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Description 

[0001] This application claims the benefit of U.S. Provisional Application No. 60/300,894, filed June 26, 2001 , and is 
a continuation-in-part of U.S. Serial No. 09/684,670, filed October 6, 2000. 

5 

Background Of The Invention 

[0002] Throughout this application, various publications are referenced in parentheses by author and year. Full citations 
for these references may be found at the end of the specification immediately preceding the claims. The disclosures of 
10 these publications in their entireties are hereby incorporated by reference into this application to more fully describe the 
state of the art to which this invention pertains. 

[0003] The ability to sequence deoxyribonucleic acid (DNA) accurately and rapidly is revolutionizing biology and 
medicine. The confluence of the massive Human Genome Project is driving an exponential growth in the development 
of high throughput genetic analysis technologies. This rapid technological development involving chemistry, engineering, 
15 biology, and computer science makes it possible to move from studying single genes at a time to analyzing and comparing 
entire genomes. 

[0004] With the completion of the first entire human genome sequence map, many areas in the genome that are highly 
polymorphic in both exons and introns will be known. The pharmacogenomics challenge is to comprehensively identify 
the genes and functional polymorphisms associated with the variability in drug response (Roses, 2000). Resequencing 

20 of polymorphic areas in the genome that are linked to disease development will contribute greatly to the understanding 
of diseases, such as cancer, and therapeutic development. Thus, high-throughput accurate methods for resequencing 
the highly variable intron/exon regions of the genome are needed in order to explore the full potential of the complete 
human genome sequence map. The current state-of-the-art technology for high throughput DNA sequencing, such as 
used for the Human Genome Project (Pennisi 2000), is capillary array DNA sequencers using laser induced fluorescence 

25 detection (Smith et a!., 1986; Ju et al. 1995, 1996; Kheterpal et al. 1996; Salas-Solano et al. 1998). Improvements in 
the polymerase that lead to uniform termination efficiency and the introduction of thermostable polymerases have also 
significantly improved the quality of sequencing data (Tabor and Richardson, 1987, 1995). Although capillary array DNA 
sequencing technology to some extent addresses the throughput and read length requirements of large scale DNA 
sequencing projects, the throughput and accuracy required for mutation studies needs to be improved for a wide variety 

30 of applications ranging from disease gene discovery to forensic identification. For example, electrophoresis based DNA 
sequencing methods have difficulty detecting heterozygotes unambiguously and are not 100% accurate in regions rich 
in nucleotides comprising guanine or cytosine due to compressions (Bowling et al. 1991; Yamakawa et al. 1997). In 
addition, the first few bases after the priming site are often masked by the high fluorescence signal from excess dye- 
labeled primers or dye-labeled terminators, and are therefore difficult to identify. Therefore, the requirement of electro- 

35 phoresis for DNA sequencing is still the bottleneck for high-throughput DNA sequencing and mutation detection projects. 
[0005] The concept of sequencing DNA by synthesis without using electrophoresis was first revealed in 1 988 (Hyman, 
1988) and involves detecting the identity of each nucleotide as it is incorporated into the growing strand of DNA in a 
polymerase reaction. Such a scheme coupled with the chip format and laser-induced fluorescent detection has the 
potential to markedly increase the throughput of DNA sequencing projects. Consequently, several groups have inves- 

*o tigated such a system with an aim to construct an ultra high-throughput DNA sequencing procedure (Cheeseman 1 994, 
Metzker et al. 1 994). Thus far, no complete success of using such a system to unambiguously sequence DNA has been 
reported. The pyrosequencing approach that employs four natural nucleotides (comprising a base of adenine (A), cytosine 
(C), guanine (G), or thymine (T)) and several other enzymes for sequencing DNA by synthesis is now widely used for 
mutation detection (Ronaghi 1998). In this approach, the detection is based on the pyrophosphate (PPi) released during 

45 the DNA polymerase reaction, the quantitative conversion of pyrophosphate to adenosine triphosphate (ATP) by sulfu- 
rylase, and the subsequent production of visible light by firefly luciferase. This procedure can only sequence up to 30 
base pairs (bps) of nucleotide sequences, and each of the 4 nucleotides needs to be added separately and detected 
separately. Long stretches of the same bases cannot be identified unambiguously with the pyrosequencing method. 
[0006] More recent work in the literature exploring DNA sequencing by a synthesis method is mostly focused on 

50 designing and synthesizing a photocleavable chemical moiety that is linked to a fluorescent dye to cap the 3-OH group 
of deoxynucleoside triphosphates (dNTPs) (Welch et al. 1999) (WO 00/53805, WO 91/06878, WO 01/92284). Limited 
success for the incorporation of the 3'-modified nucleotide by DNA polymerase is reported. The reason is that the 3- 
position on the deoxyribose is very close to the amino acid residues in the active site of the polymerase, and the 
polymerase is therefore sensitive to modification in this area of the deoxyribose ring. On the other hand, it is known that 

55 modified DNA polymerases (Thermo Sequenase and Taq FS polymerase) are able to recognize nucleotides with ex- 
tensive modifications with bulky groups such as energy transfer dyes at the 5-position of the pyrimidines (T and C) and 
at the 7-position of purines (G and A) (Rosenblum et al. 1997, Zhu et al. 1994). The ternary complexes of rat DNA 
polymerase, a DNA template-primer, and dideoxycytidine triphosphate (ddCTP) have been determined (Pelletier et al. 
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1994) which supports this fact. As shown in Figure 1, the 3-D structure indicates that the surrounding area of the 3- 
position of the deoxyribose ring in ddCTP is very crowded, while there is ample space for modification on the 5-position 
the cytidine base. 

[0007] The approach disclosed in the present application is to make nucleotide analogues by linking a unique label 
5 such as a fluorescent dye or a mass tag through a cleavable linker to the nucleotide base or an analogue of the nucleotide 
base, such as to the 5-position of the pyrimidines (T and C) and to the 7-position of the purines (G and A), to use a small 
cleavable chemical moiety to cap the 3-OH group of the deoxyribose to make it nonreactive, and to incorporate the 
nucleotide analogues into the growing DNA strand as terminators. Detection of the unique label will yield the sequence 
identity of the nucleotide. Upon removing the label and the 3-OH capping group, the polymerase reaction will proceed 
10 to incorporate the next nucleotide analogue and detect the next base. 

[0008] It is also desirable to use a photocleavable group to cap the 3'-OH group. However, a photocleavable group 
is generally bulky and thus the DNA polymerase will have difficulty to incorporate the nucleotide analogues containing 
a photocleavable moiety capping the 3' -OH group. If small chemical moieties that can be easily cleaved chemically with 
high yield can be used to cap the 3-OH group, such nucleotide analogues should also be recognized as substrates for 
15 DNA polymerase. It has been reported that 3'-0-methoxy-deoxynucleotides are good substrates for several polymerases 
(Axelrod et al. 1978). 3'-0-allyl-dATP was also shown to be incorporated by Ventr(exo-) DNA polymerase in the growing 
strand of DNA (Metzker et al. 1994). However, the procedure to chemically cleave the methoxy group is stringent and 
requires anhydrous conditions. Thus, it is not practical to use a methoxy group to cap the 3'-OH group for sequencing 
DNA by synthesis. An ester group was also explored to cap the 3'-OH group of the nucleotide, but it was shown to be 
2Q cleaved by the nucleophiles in the active site in DNA polymerase (Canard et al. 1995). Chemical groups with electrophiles 
such as ketone groups are not suitable for protecting the 3' -OH of the nucleotide in enzymatic reactions due to the 
existence of strong nucleophiles in the polymerase. It is known that MOM (-CH 2 OCH 3 ) and allyl (-CH 2 CH=CH 2 ) groups 
can be used to cap an -OH group, and can be cleaved chemically with high yield (Ireland et al. 1986; Kamal et al. 1999). 
The approach disclosed in the present application is to incorporate nucleotide analogues, which are labeled with cleav- 
es able, unique labels such as fluorescent dyes or mass tags and where the 3-OH is capped with a cleavable chemical 
moiety such as either a MOM group (-CH 2 OCH 3 ) or an allyl group (-CH 2 CH=CH 2 ), into the growing strand DNA as 
terminators. The optimized nucleotide set ( 3 '.ro- a -|_abeli . 3'-ro _c -label2' 3'-ro- g -label3» 3'-ro- t -label4» wnere R de- 
notes the chemical group used to cap the 3-OH) can then be used for DNA sequencing by the synthesis approach. 
[0009] There are many advantages of using mass spectrometry (MS) to detect small and stable molecules. For 
30 example, the mass resolution can be as good as one dalton. Thus, compared to gel electrophoresis sequencing systems 
and the laser induced fluorescence detection approach which have overlapping fluorescence emission spectra, leading 
to heterozygote detection difficulty, the MS approach disclosed in this application produces very high resolution of 
sequencing data by detecting the cleaved small mass tags instead of the long DNA fragment. This method also produces 
extremely fast separation in the time scale of microseconds. The high resolution allows accurate digital mutation and 
35 heterozygote detection. Another advantage of sequencing with mass spectrometry by detecting the small mass tags is 
that the compressions associated with gel based systems are completely eliminated. 

[0010] In order to maintain a continuous hybridized primer extension product with the template DNA, a primer that 
contains a stable loop to form an entity capable of self-priming in a polymerase reaction can be ligated to the 3' end of 
each single stranded DNA template that is immobilized on a solid surface such as a chip. This approach will solve the 

40 problem of washing off the growing extension products in each cycle. 

[001 1] Saxon and Bertozzi (2000) developed an elegant and highly specific coupling chemistry linking a specific group 
that contains a phosphine moiety to an azido group on the surface of a biological cell. In the present application, this 
coupling chemistry is adopted to create a solid surface which is coated with a covalently linked phosphine moiety, and 
to generate polymerase chain reaction (PCR) products that contain an azido group at the 5' end for specific coupling of 

45 the DNA template with the solid surface. One example of a solid surface is glass channels which have an inner wall with 
an uneven or porous surface to increase the surface area. Another example is a chip. 

[0012] The present application discloses a novel and advantageous system for DNA sequencing by the synthesis 
approach which employs a stable DNA template, which is able to self prime for the polymerase reaction, covalently 
linked to a solid surface such as a chip, and 4 unique nucleotides analogues 
50 (3'.ro" a -labeli» 3-ro- c -|_abel2' 3'-ro- g -label3» 3'-ro- t -label4)- The success of this novel system will allow the devel- 
opment of an ultra high-throughput and high fidelity DNA sequencing system for polymorphism, pharmacogenetics 
applications and for whole genome sequencing. This fast and accurate DNA resequencing system is needed in such 
fields as detection of single nucleotide polymorphisms (SNPs) (Chee et al. 1996), serial analysis of gene expression 
(SAGE) (Velculescu et al. 1995), identification in forensics, and genetic disease association studies. 

55 

Summary Of The Invention 

[001 3] This invention is directed to a method for sequencing a nucleic acid by sequentially determining the identity of 
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a nucleotide analogue after the nucleotide analogue is incorporated into a growing strand of DNA in a polymerase 
reaction, which comprises the following steps: 

(i) attaching a 5' end of the nucleic acid to a solid surface; 

5 

(ii) attaching a primer to the nucleic acid attached to the solid surface; 

(iii) adding a polymerase and one or more different nucleotide analogues to the nucleic acid to thereby incorporate 
a nucleotide analogue into the growing strand of DNA, wherein the incorporated nucleotide analogue terminates 

10 the polymerase reaction and wherein each different nucleotide analogue comprises (a) a base selected from the 

group consisting of adenine, guanine, cytosine, thymine, and uracil, and their analogues; (b) a unique label attached 
through a cleavable linker to the base or to an analogue of the base; (c) a deoxyribose; and (d) a cleavable chemical 
group to cap an -OH group at a 3-position of the deoxyribose, wherein the cleavable chemical group is -CH 2 OCH 3 
or-CH 2 CH=CH 2 ; 

15 

(iv) washing the solid surface to remove unincorporated nucleotide analogues; 

(v) determining the identity of the unique label attached to the nucleotide analogue that has been incorporated into 
the growing strand of DNA, so as to thereby identify the incorporated nucleotide analogue; 

20 

(vi) adding one or more chemical compounds to permanently cap any unreacted -OH group on the primer attached 
to the nucleic acid or on a primer extension strand formed by adding one or more nucleotides or nucleotide analogues 
to the primer; 

25 (vii) cleaving the cleavable linker between the nucleotide analogue that was incorporated into the growing strand of 

DNA and the unique label; 

(viii) cleaving the cleavable chemical group capping the -OH group at the 3* -position of the deoxyribose to uncap 
the -OH group, and washing the solid surface to remove cleaved compounds; and 

30 

(ix) repeating steps (iii) through (viii) so as to determine, for each repetition, the identity of the newly incorporated 
nucleotide analogue into the growing strand of DNA; 

wherein if the unique label is a dye, the order of steps (v) through (vii) is: (v), (vi), and (vii); and 
35 wherein if the unique label is a mass tag, the order of steps (v) through (vii) is: (vi), (vii), and (v). 

[0014] The invention provides a method of attaching a nucleic acid to a solid surface which comprises: 

(i) coating the solid surface with a phosphine moiety, 

to (jj) attaching an azido group to a 5' end of the nucleic acid, and 

(iii) immobilizing the 5' end of the nucleic acid to the solid surface through interaction between the phosphine moiety 
on the solid surface and the azido group on the 5' end of the nucleic acid. 

45 [0015] The invention provides a nucleotide analogue which comprises: 

(a) a base selected from the group consisting of adenine or an analogue of adenine, cytosine or an analogue of 
cytosine, guanine or an analogue of guanine, thymine or an analogue of thymine, and uracil or an analogue of uracil; 

so (b) a unique label attached through a cleavable linker to the base or to an analogue of the base; 

(c) a deoxyribose; and 

(d) a cleavable chemical group to cap an -OH group at a 3'-position of the deoxyribose, wherein the cleavable 
55 chemical group is -CH 2 OCH 3 or -CH 2 CH=CH 2 . 
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Brief Description Of The Figures 
[0016] 

5 Figure 1 : The 3D structure of the ternary complexes of rat DNA polymerase, a DNA template- primer, and dideox- 

ycytidine triphosphate (ddCTP). The left side of the illustration shows the mechanism for the addition of ddCTP and 
the right side of the illustration shows the active site of the polymerase. Note that the 3' position of the dideoxyribose 
ring is very crowded, while ample space is available at the 5 position of the cytidine base. 

10 Figure 2A-2B: Scheme of sequencing by the synthesis approach. A: Example where the unique labels are dyes 

and the solid surface is a chip. B: Example where the unique labels are mass tags and the solid surface is channels 
etched into a glass chip. A, C, G, T; nucleotide triphosphates comprising bases adenine, cytosine, guanine, and 
thymine; d, deoxy; dd, dideoxy; R, cleavable chemical group used to cap the -OH group; Y, cleavable linker. 

15 Figure 3: The synthetic scheme for the immobilization of an azido (N 3 ) labeled DNA fragment to a solid surface 

coated with a triarylphosphine moiety. Me, methyl group; P, phosphorus; Ph, phenyl. 

Figure 4: The synthesis of triarylphosphine N-hydroxysuccinimide (NHS) ester. 

20 Figure 5: The synthetic scheme for attaching an azido (N 3 ) group through a linker to the 5' end of a DNA fragment, 

which is then used to couple with the triarylphosphine moiety on a solid surface. DMSO, dimethylsulfonyl oxide. 

Figure 6A-6B: Ligate the looped primer (B) to the immobilized single stranded DNA template forming a self primed 
DNA template moiety on a solid surface. P (in circle), phosphate. 

25 

Figure 7: Examples of structures of four nucleotide analogues for use in the sequencing by synthesis approach. 
Each nucleotide analogue has a unique fluorescent dye attached to the base through a photocieavabie linker and 
the 3-OH is either exposed or capped with a MOM group or an allyl group. FAM, 5-carboxyfluorescein; R6G, 6- 
carboxyrhodamine-6G; TAM, N,N,N',N'-tetramethyl-6-carboxyrhodamine; ROX, 6-carboxy-X-rhodamine. R = H, 
30 CH 2 OCH 3 (MOM) or CH 2 CH=CH 2 (Allyl). 

Figure 8: A representative scheme for the synthesis of the nucleotide analogue 3'.Rcr G -Tanv A similar scheme can 
be used to create the other three modified nucleotides: 3'.Ro-A- 0y ei. 3'-RO"C~Dye2' 3'-RO'T-Dye4- (') tetrakis(triphenyl- 
phosphine)palladium(O); (ii) POCI 3 , Bn 4 N + pyrophosphate; (iii) NH 4 OH; (iv) Na 2 C0 3 /NaHC0 3 (pH = 9.0J/DMSO. 

35 

Figure 9: A scheme for testing the sequencing by synthesis approach. Each nucleotide, modified by the attachment 
of a unique fluorescent dye, is added one by one, based on the complimentary template. The dye is detected and 
cleaved to test the approach. Dye1 = Fam; Dye2 = R6G; Dye3 = Tarn; Dye4 = Rox. 

40 Figure 1 0: The expected photocleavage products of DNA containing a photo-cleavable dye (Tarn). Light absorption 

(300 - 360 nm) by the aromatic 2-nitrobenzyl moiety causes reduction of the 2-nitro group to a nitroso group and 
an oxygen insertion into the carbon-hydrogen bond located in the 2-position followed by cleavage and decarboxylation 
(Pillai 1980). 

45 Figure 11: Synthesis of PC-LC-Biotin-FAM to evaluate the photolysis efficiency of the fluorophore coupled with 

the photocieavabie linker 2-nitrobenzyl group. 

Figure 12: Fluorescence spectra (X ex = 480 nm) of PC-LC-Biotin-FAM immobilized on a microscope glass slide 
coated with streptavidin (a); after 10 min photolysis (\ r = 350 nm; -0.5 mW/cm 2 ) (b); and after washing with water 
50 to remove the photocleaved dye (c). 

Figure 13A-13B: Synthetic scheme for capping the 3-OH of nucleotide. 

Figure 14: Chemical cleavage of the MOM group (top row) and the allyl group (bottom row) to free the 3-OH in the 
55 nucleotide. CITMS = chlorotrimethylsilane. 

Figure 15A-15B: Examples of energy transfer coupled dye systems, where Fam or Cy2 is employed as a light 
absorber (energy transfer donor) and CI 2 Fam, CI 2 R6G r CI 2 Tam, or CI 2 Rox as an energy transfer acceptor. Cy2, 
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cyanine; FAM, 5-carboxyfluorescein; R6G, 6-carboxyrhodamine-6G; TAM, N,N,N\NMetramethyl-6-carfcoxyrhod- 
amine; ROX, 6-carboxy-X-rhod amine. 

Figure 16: The synthesis of a photocleavable energy transfer dye-labeled nucleotide. DMF, dimethylformide. DEC 
5 = 1-(3-dimethylaminopropyl)-3-ethylcarbodimide hydrochloride. R = H, CH 2 OCH 3 (MOM) or CH 2 CH=CH 2 (Ally!). 

Figure 17: Structures of four mass tag precursors and four photoactive mass tags. Precursors: a) acetophenone; 
b) 3-fluoroacetophenone; c) 3,4-difluoroacetophenone; and d) 3,4-dimethoxyacetophenone. Four photoactive mass 
tags are used to code for the identity of each of the four nucleotides (A, C, G, T). 

10 

Figure 18: Atmospheric Pressure Chemical Ionization (APCI) mass spectrum of mass tag precursors shown in 
Figure 17. 

Figure 19: Examples of structures of four nucleotide analogues for use in the sequencing by synthesis approach. 
15 Each nucleotide analogue has a unique mass tag attached to the base through a photocleavable linker, and the 3'- 

OH is either exposed or capped with a MOM group or an allyl group. The square brackets indicated that the mass 
tag is cleavable. R = H, CH 2 OCH 3 (MOM) or CH 2 CH=CH 2 (Allyl). 

Figure 20: Example of synthesis of NHS ester of one mass tag (Tag-3). A similar scheme is used to create other 
20 mass tags. 

Figure 21 : A representative scheme for the synthesis of the nucleotide analogue 3 '.Ro-G-Tag3* A similar scheme is 
used to create the other three modified bases s-.Ro-A-Tagi • 3'-RO'C"Tag2' 3'-RO" T Tag4- (0 tetrakis(triphenylphosphine) 
palladium(O); (ii) POCI 3 , Bn 4 N + pyrophosphate; (iii) NH 4 OH; (iv) Na 2 C0 3 /NaHC0 3 (pH = 9.0)/DMSO. 

25 

Figure 22: Examples of expected photocleavage products of DNA containing a photocleavable mass tag. 
Detailed Description Of The Invention 

30 [0017] The following definitions are presented as an aid in understanding this invention. 

[0018] As used herein, to cap an -OH group means to replace the "IT in the -OH group with a chemical group. As 
disclosed herein, the -OH group of the nucleotide analogue is capped with a cleavable chemical group. To uncap an 
-OH group means to cleave the chemical group from a capped -OH group and to replace the chemical group with "H", 
i.e., to replace the "R" in -OR with "H" wherein "R" is the chemical group used to cap the -OH group. 

35 [0019] The nucleotide bases are abbreviated as follows: adenine (A), cytosine (C), guanine (G), thymine (T), and 
uracil (U). 

[0020] An analogue of a nucleotide base refers to a structural and functional derivative of the base of a nucleotide 
which can be recognized by polymerase as a substrate. That is, for example, an analogue of adenine (A) should form 
hydrogen bonds with thymine (T), a C analogue should form hydrogen bonds with G, a G analogue should form hydrogen 
40 bonds with C, and a T analogue should form hydrogen bonds with A, in a double helix format. Examples of analogues 
of nucleotide bases include, but are not limited to, 7-deaza-adenine and 7-deaza-guanine, wherein the nitrogen atom 
at the 7-position of adenine or guanine is substituted with a carbon atom. 

[0021] A nucleotide analogue refers to a chemical compound that is structurally and functionally similar to the nu- 
cleotide, i.e. the nucleotide analogue can be recognized by polymerase as a substrate. That is, for example, a nucleotide 

45 analogue comprising adenine or an analogue of adenine should form hydrogen bonds with thymine, a nucleotide analogue 
comprising C or an analogue of C should form hydrogen bonds with G, a nucleotide analogue comprising G or an 
analogue of G should form hydrogen bonds with C, and a nucleotide analogue comprising T or an analogue of T should 
form hydrogen bonds with A, in a double helix format. Examples of nucleotide analogues disclosed herein include 
analogues which comprise an analogue of the nucleotide base such as 7-deaza-adenine or 7-deaza-guanine, wherein 

50 the nitrogen atom at the 7-position of adenine or guanine is substituted with a carbon atom. Further examples include 
analogues in which a label is attached through a cleavable linker to the 5-position of cytosine or thymine or to the 7- 
position of deaza-adenine or deaza-guanine. Other examples include analogues in which a small chemical moiety such 
as - CH 2 OCH 3 or -CH 2 CH=CH 2 is used to cap the -OH group at the 3'-position of deoxyribose. Analogues of dideoxy- 
nucleotides can similarly be prepared. 

55 [0022] As used herein, a porous surface is a surface which contains pores or is otherwise uneven, such that the 
surface area of the porous surface is increased relative to the surface area when the surface is smooth. 
[0023] The present invention is directed to a method for sequencing a nucleic acid by sequentially determining the 
identity of a nucleotide analogue after the nucleotide analogue is incorporated into a growing strand of DNA in a polymer- 
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ase reaction, which comprises the following steps: 

(i) attaching a 5' end of the nucleic acid to a solid surface; 

5 (ii) attaching a primer to the nucleic acid attached to the solid surface; 

(iii) adding a polymerase and one or more different nucleotide analogues to the nucleic acid to thereby incorporate 
a nucleotide analogue into the growing strand of DNA, wherein the incorporated nucleotide analogue terminates 
the polymerase reaction and wherein each different nucleotide analogue comprises (a) a base selected from the 
10 group consisting of adenine, guanine, cytosine, thymine, and uracil, and their analogues; (b) a unique label attached 

through a cleavable linker to the base or to an analogue of the base; (c) a deoxyribose; and (d) a cleavable chemical 
group to cap an -OH group at a 3'-position of the deoxyribose, wherein the cleavable chemical group is -CH 2 OCH 3 
or-CH 2 CH=CH 2 ; 

15 (iv) washing the solid surface to remove unincorporated nucleotide analogues; 

(v) determining the identity of the unique label attached to the nucleotide analogue that has been incorporated into 
the growing strand of DNA, so as to thereby identify the incorporated nucleotide analogue; 

20 (vi) adding one or more chemical compounds to permanently cap any unreacted -OH group on the primer attached 

to the nucleic acid or on a primer extension strand formed by adding one or more nucleotides or nucleotide analogues 
to the primer; 

(vii) cleaving the cleavable linker between the nucleotide analogue that was incorporated into the growing strand of 
25 DNA and the unique label; 

(viii) cleaving the cleavable chemical group capping the -OH group at the 3'-position of the deoxyribose to uncap 
the -OH group, and washing the solid surface to remove cleaved compounds; and 

so (jx) repeating steps (iii) through (viii) so as to determine, for each repetition, the identity of the newly incorporated 

nucleotide analogue into the growing strand of DNA; 

wherein if the unique label is a dye, the order of steps (v) through (vii) is: (v), (vi), and (vii); and 
wherein if the unique label is a mass tag, the order of steps (v) through (vii) is: (vi), (vii), and (v). 

35 [0024] In one embodiment of any of the nucleotide analogues described herein, the nucleotide base is adenine. In 
one embodiment, the nucleotide base is guanine. In one embodiment, the nucleotide base is cytosine. In one embodiment, 
the nucleotide base is thymine. In one embodiment, the nucleotide base is uracil. In one embodiment, the nucleotide 
base is an analogue of adenine. In one embodiment, the nucleotide base is an analogue of guanine. In one embodiment, 
the nucleotide base is an analogue of cytosine. In one embodiment, the nucleotide base is an analogue of thymine. In 

*o one embodiment, the nucleotide base is an analogue of uracil. 

[0025] In different embodiments of any of the inventions described herein, the solid surface is glass, silicon, or gold. 
In different embodiments, the solid surface is a magnetic bead, a chip, a channel in a chip, or a porous channel in a 
chip. In one embodiment, the solid surface is glass. In one embodiment, the solid surface is silicon. In one embodiment, 
the solid surface is gold. In one embodiments, the solid surface is a magnetic bead. In one embodiment, the solid surface 

45 is a chip. In one embodiment, the solid surface is a channel in a chip. In one embodiment, the solid surface is a porous 
channel in a chip. Other materials can also be used as long as the material does not interfere with the steps of the method. 
[0026] In one embodiment, the step of attaching the nucleic acid to the solid surface comprises: 

(i) coating the solid surface with a phosphine moiety, 

so 

(ii) attaching an azido group to the 5' end of the nucleic acid, and 

(iii) immobilizing the 5' end of the nucleic acid to the solid surface through interaction between the phosphine moiety 
on the solid surface and the azido group on the 5' end of the nucleic acid. 

55 

[0027] In one embodiment, the step of coating the solid surface with the phosphine moiety comprises: 
(i) coating the surface with a primary amine, and 
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(ii) covalently coupling a N-hydroxysuccinimidyl ester of triarylphosphine with the primary amine. 

[0028] In one embodiment, the nucleic acid that is attached to the solid surface is a single-stranded deoxyribonucleic 
acid (DNA). In another embodiment, the nucleic acid that is attached to the solid surface in step (i) is a double-stranded 

5 DNA, wherein only one strand is directly attached to the solid surface, and wherein the strand that is not directly attached 
to the solid surface is removed by denaturing before proceeding to step (ii). In one embodiment, the nucleic acid that is 
attached to the solid surface is a ribonucleic acid (RNA), and the polymerase in step (iii) is reverse transcriptase. 
[0029] In one embodiment, the primer is attached to a 3' end of the nucleic acid in step (ii), and the attached primer 
comprises a stable loop and an -OH group at a 3-position of a deoxyribose capable of self-priming in the polymerase 

10 reaction. In one embodiment, the step of attaching the primer to the nucleic acid comprises hybridizing the primer to the 
nucleic acid or ligating the primer to the nucleic acid. In one embodiment, the primer is attached to the nucleic acid 
through a ligation reaction which links the 3' end of the nucleic acid with the 5' end of the primer. 
[0030] In one embodiment, one or more of four different nucleotide analogs is added in step (iii), wherein each different 
nucleotide analogue comprises a different base selected from the group consisting of thymine or uracil or an analogue 

15 of thymine or uracil, adenine or an analogue of adenine, cytosine or an analogue of cytosine, and guanine or an analogue 
of guanine, and wherein each of the four different nucleotide analogues comprises a unique label. 
[0031] The cleavable chemical group that caps the -OH group at the 3 '-position of the deoxyribose in the nucleotide 
analogue is -CH 2 OCH 3 or -CH 2 CH=CH 2 . Any one of these chemical groups 1) is stable during the polymerase reaction, 
2) does not interfere with the recognition of the nucleotide analogue by polymerase as a substrate, and 3) is cleavable. 

20 [0032] In one embodiment, the unique label that is attached to the nucleotide analogue is a fluorescent moiety or a 
fluorescent semiconductor crystal. In further embodiments, the fluorescent moiety is selected from the group consisting 
of 5-carboxyfluorescein, 6-carboxyrhodamine-6G, N,N,N\NMetramethyl-6-cart>oxyrhodamine, and 6-carboxy-X-rhod- 
amine. In one embodiment, the fluorescent moiety is 5-carboxyfluorescein. In one embodiment, the fluorescent moiety 
is 6-carboxyrhodamtne-6G, N,N,N',N'-tetramethyl-6-carboxyrhodamine. In one embodiment, the fluorescent moiety is 

25 6-carboxy-X-rhodamine. 

[0033] In one embodiment, the unique label that is attached to the nucleotide analogue is a fluorescence energy 
transfer tag which comprises an energy transfer donor and an energy transfer acceptor. In further embodiments, the 
energy transfer donor is 5-carboxyfluorescein or cyanine, and wherein the energy transfer acceptor is selected from the 
group consisting of dichlorocarboxyfluorescein, dichloro-6-carboxyrhodamine-6G, dichloro-N.N.N'.N-tetramethyl-S-car- 

30 boxyrhodamine, and dichloro-6-carboxy-X-rhodamine. In one embodiment, the energy transfer acceptor is dichlorocar- 
boxyfluorescein. In one embodiment, the energy transfer acceptor is dichloro-6-carboxyrhodamine-6G. In one embod- 
iment, the energy transfer acceptor is dichloro-N.N.N'.N'-tetramethyl-e-carboxyrhodamine. In one embodiment, the en- 
ergy transfer acceptor is dichloro-6-carboxy-X-rhodamine. 

[0034] In one embodiment, the unique label that is attached to the nucleotide analogue is a mass tag that can be 
35 detected and differentiated by a mass spectrometer. In further embodiments, the mass tag is selected from the group 
consisting of a 2-nitro-a-methyl-benzyl group, a 2-nitro-a-methyl-3-fluorobenzyl group, a 2-nitro-a-methyl-3,4-difluor- 
obenzyl group, and a 2-nitro-a-methyl-3,4-dimethoxybenzyl group. In one embodiment, the mass tag is a 2-nitro-a- 
methyl-benzyl group. In one embodiment, the mass tag is a 2-nitro-a-methyl-3-fluorobenzyl group. In one embodiment, 
the mass tag is a 2-nitro-a-methyl-3,4-difluorobenzyl group. In one embodiment, the mass tag is a 2-nitro-a-methyl-3,4- 
40 dimethoxybenzyl group. In one embodiment, the mass tag is detected using a parallel mass spectrometry system which 
comprises a plurality of atmospheric pressure chemical ionization mass spectrometers for parallel analysis of a plurality 
of samples comprising mass tags. 

[0035] In one embodiment, the unique label is attached through a cleavable linker to a 5-position of cytosine or thymine 
or to a 7-position of deaza-adenine or deaza-guanine. The unique label could also be attached through a cleavable 
45 linker to another position in the nucleotide analogue as long as the attachment of the label is stable during the polymerase 
reaction and the nucleotide analog can be recognized by polymerase as a substrate. For example, the cleavable label 
could be attached to the deoxyribose. 

[0036] In one embodiment, the linker between the unique label and the nucleotide analogue is cleaved by a means 
selected from the group consisting of one or more of a physical means, a chemical means, a physical chemical means, 
50 heat, and light. In one embodiment, the linker is cleaved by a physical means. In one embodiment, the linker is cleaved 
by a chemical means. In one embodiment, the linker is cleaved by a physical chemical means. In one embodiment, the 
linker is cleaved by heat. In one embodiment, the linker is cleaved by light. In one embodiment, the linker is cleaved by 
ultraviolet light. In a further embodiment, the cleavable linker is a photocleavable linker which comprises a 2-nitrobenzyl 
moiety. 

55 [0037] In one embodiment, the cleavable chemical group used to cap the -OH group at the 3'-position of the deoxyribose 
is cleaved by a means selected from the group consisting of one or more of a physical means, a chemical means, a 
physical chemical means, heat, and light. In one embodiment, the linker is cleaved by a physical chemical means. In 
one embodiment, the linker is cleaved by heat. In one embodiment, the linker is cleaved by light. In one embodiment, 
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the linker is cleaved by ultraviolet light. 

[0038] In one embodiment, the chemical compounds added in step (vi) to permanently cap any unreacted -OH group 
on the primer attached to the nucleic acid or on the primer extension strand are a polymerase and one or more different 
dideoxynucleotides or analogues of dideoxynucleotides. In further embodiments, the different dideoxynucleotides are 

5 selected from the group consisting of 2\3'-dideoxyadenosine 5-triphosphate, 2\3'-dideoxyguanosine 5'-tri phosphate, 
2',3'-dideoxycytidine 5'-triphosphate, 2\3'-dideoxythymidine 5-triphosphate, 2',3'-dideoxyuridine 5'-triphosphase, and 
their analogues. In one embodiment, the dideoxynucleotide is 2\3'-dideoxyadenosine 5-triphosphate. In one embodi- 
ment, the dideoxynucleotide Is 2',3'-dideoxyguanosine 5-triphosphate. In one embodiment, the dideoxynucleotide is 2', 
3'-dideoxycytidine 5'-triphosphate. In one embodiment, the dideoxynucleotide is 2',3-dideoxythymidine 5-triphosphate. 

10 in one embodiment, the dideoxynucleotide is 2',3-dideoxyuridine 5'-triphosphase. In one embodiment, the dideoxynu- 
cleotide is an analogue of 2',3'-dideoxyadenosine 5'-triphosphate. In one embodiment, the dideoxynucleotide is an 
analogue of 2\3'-dideoxyguanosine 5-triphosphate. In one embodiment, the dideoxynucleotide is an analogue of 2\3'- 
dideoxycytidine 5'-triphosphate. In one embodiment, the dideoxynucleotide is an analogue of 2',3'-dideoxythymidine 5'- 
triphosphate. In one embodiment, the dideoxynucleotide is an analogue of 2',3'-dideoxyuridine 5'-triphosphase. 

15 [0039] In one embodiment, a polymerase and one or more of four different dideoxynucleotides are added in step (vi), 
wherein each different dideoxynucleotide is selected from the group consisting of 2',3-dideoxyadenosine 5-triphosphate 
or an analogue of 2',3-dideoxyadenosine 5-triphosphate; 2',3-dideoxyguanosine 5-triphosphate or an analogue of 2', 
3'-dideoxyguanosine 5'-triphosphate; 2',3'-dideoxycytidine 5-triphosphate or an analogue of 2\3'-dideoxycytidine 5'- 
triphosphate; and 2',3-dideoxythymidine 5'-triphosphate or 2',3'-dideoxyuridine 5'-triphosphase or an analogue of 2,3- 

20 dideoxythymidine ^-triphosphate or an analogue of 2',3'-dideoxyuridine 5-triphosphase. In one embodiment, the dide- 
oxynucleotide is 2\3'-dideoxyadenosine 5'-tri phosphate. In one embodiment, the dideoxynucleotide is an analogue of 
2',3-dideoxyadenosine 5'-triphosphate. In one embodiment, the dideoxynucleotide is 2',3'-dideoxyguanosine 5-triphos- 
phate. In one embodiment, the dideoxynucleotide is an analogue of 2',3'-dideoxyguanosine 5'-triphosphate. In one 
embodiment, the dideoxynucleotide is 2\3'-dideoxycytidine 5-triphosphate. In one embodiment, the dideoxynucleotide 

25 js an analogue of 2\3'-dideoxycytidine ^-triphosphate. In one embodiment, the dideoxynucleotide is 2',3-dideoxythy- 
midine 5'-triphosphate. In one embodiment, the dideoxynucleotide is 2',3'-dideoxyuridine 5-triphosphase. In one em- 
bodiment, the dideoxynucleotide is an analogue of 2',3'-dideoxythymidine 5-triphosphate. In one embodiment, the 
dideoxynucleotide is an analogue of 2',3'-dideoxyuridine 5-triphosphase. 

[0040] Another type of chemical compound that reacts specifically with the -OH group could also be used to permanently 
30 cap any unreacted -OH group on the primer attached to the nucleic acid or on an extension strand formed by adding 
one or more nucleotides or nucleotide analogues to the primer. 

[0041] The invention provides a method for simultaneously sequencing a plurality of different nucleic acids, which 
comprises simultaneously applying any of the methods disclosed herein for sequencing a nucleic acid to the plurality of 
different nucleic acids. In different embodiments, the method can be used to sequence from one to over 1 00,000 different 
35 nucleic acids simultaneously. 

[0042] The invention provides for the use of any of the methods disclosed herein for detection of single nucleotide 
polymorphisms, genetic mutation analysis, serial analysis of gene expression, gene expression analysis, identification 
in forensics, genetic disease association studies, DNA sequencing, genomic sequencing, translation^ analysis, or tran- 
scriptional analysis. 

40 [0043] The invention provides a method of attaching a nucleic acid to a solid surface which comprises: 



(i) coating the solid surface with a phosphine moiety, 



45 



50 



(ii) attaching an azido group to a 5' end of the nucleic acid, and 

(iii) immobilizing the 5' end of the nucleic acid to the solid surface through interaction between the phosphine moiety 
on the solid surface and the azido group on the 5' end of the nucleic acid. 

[0044] In one embodiment, the step of coating the solid surface with the phosphine moiety comprises: 

(i) coating the surface with a primary amine, and 

(ii) covalently coupling a N-hydroxysuccinimidyl ester of triarylphosphine with the primary amine. 



55 [0045] In different embodiments, the solid surface is glass, silicon, or gold. In different embodiments, the solid surface 
is a magnetic bead, a chip, a channel in an chip, or a porous channel in a chip. 

[0046] In different embodiments, the nucleic acid that is attached to the solid surface is a single-stranded or double- 
stranded DNA or a RNA. In one embodiment, the nucleic acid is a double-stranded DNA and only one strand is attached 



9 



1 



EP 1 337 541 B1 



to the solid surface. In a further embodiment, the strand of the double-stranded DNA that is not attached to the solid 
surface is removed by denaturing. 

[0047] The invention provides for the use of any of the methods disclosed herein for attaching a nucleic acid to a 
surface for gene expression analysis, microarray based gene expression analysis, or mutation detection, translational 
5 analysis, transcriptional analysis, or for other genetic applications. 

[0048] The invention provides a nucleotide analogue which comprises: 

(a) a base selected from the group consisting of adenine or an analogue of adenine, cytosine or an analogue of 
cytosine, guanine or an analogue of guanine, thymine or an analogue of thymine, and uracil or an analogue of uracil; 

10 

(b) a unique label attached through a cleavable linker to the base or to an analogue of the base; 

(c) a deoxyribose; and 

15 (d) a cleavable chemical group to cap an -OH group at a 3'-position of the deoxyribose, wherein the cleavable 

chemical group is -CH 2 OCH 3 or -CH 2 CH=CH 2 . 

[0049] In one embodiment of the nucleotide analogue, the cleavable chemical group that caps the -OH group at the 
3'-position of the deoxyribose is -CH 2 CH=CH 2 . 
20 [0050] In one embodiment, the unique label is a fluorescent moiety or a fluorescent semiconductor crystal. In further 
embodiments, the fluorescent moiety is selected from the group consisting of 5-carboxyfluorescein, 6-carboxyrhodamine- 
6G, N,N,N\NMetramethyl-6-carboxyrhodamine, and 6-carboxy-X-rhodamine. 

[0051 ] In one embodiment, the unique label is a fluorescence energy transfer tag which comprises an energy transfer 
donor and an energy transfer acceptor. In further embodiments, the energy transfer donor is 5-carboxyfluorescein or 
25 cyanine, and wherein the energy transfer acceptor is selected from the group consisting of dichlorocarboxyfluorescein, 
dichloro-6-carboxyrhodamine-6G, dichloro-N.N^'.N'-tetramethyl-e-carboxyrhodamine, and dichloro-6-carboxy-X-rhod- 
amine. 

[0052] In one embodiment, the unique label is a mass tag that can be detected and differentiated by a mass spec- 
trometer. In further embodiments, the mass tag is selected from the group consisting of a 2-nitro-a-methyl-benzyl group, 
30 a 2-nitro-a-methyl-3-fluorobenzyl group, a 2-nitro-a-methyl-3,4-difluorobenzyl group, and a 2-nitro-o>methyl-3,4-dimeth- 
oxybenzyl group. 

[0053] In one embodiment, the unique label is attached through a cleavable linker to a 5-position of cytosine or thymine 
or to a 7-position of deaza-adenine or deaza-guanine. The unique label could also be attached through a cleavable 
linker to another position in the nucleotide analogue as long as the attachment of the label is stable during the polymerase 
35 reaction and the nucleotide analog can be recognized by polymerase as a substrate. For example, the cleavable label 
could be attached to the deoxyribose. 

[0054] In one embodiment, the linker between the unique label and the nucleotide analogue is cleavable by a means 
selected from the group consisting of one or more of a physical means, a chemical means, a physical chemical means, 
heat, and light. In a further embodiment, the cleavable linker is a photocleavable linker which comprises a 2-nitrobenzyl 
^o moiety. 

[0055] In one embodiment, the cleavable chemical group used to cap the -OH group at the 3-position of the deoxyribose 
is cleavable by a means selected from the group consisting of one or more of a physical means, a chemical means, a 
physical chemical means, heat, and light. 

[0056] In different embodiments, the nucleotide analogue is selected from the group consisting of: 

45 



50 
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wherein Dye 1 , Dye 2 , Dye 3 , and Dye 4 are four different unique labels; and 
wherein R is -CH 2 OCH 3 or -CH 2 CH=CH 2 . 

[0057] In different embodiments, the nucleotide analogue is selected from the group consisting of: 




OR 
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and 




wherein Tag.,, Tag 2 , Tag 3> and Tag 4 are four different mass tag labels; and 
wherein R is -CH 2 OCH 3 or -CH 2 CH=CH 2 . 

[0059] In different embodiments, the nucleotide analogue is selected from the group consisting of: 




OR 
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wherein R is -CH 2 OCH 3 or -CH 2 CH=CH 2 . 

[0060] The invention provides for the use any of the nucleotide analogues disclosed herein for detection of single 
nucleotide polymorphisms, genetic mutation analysis, serial analysis of gene expression, gene expression analysis, 
identification in forensics, genetic disease association studies, DNA sequencing, genomic sequencing, translational 
is analysis, or transcriptional analysis. 

Experimental Details 

1. The Sequencing by Synthesis Approach 

20 

[0061 J Sequencing DNA by synthesis involves the detection of the identity of each nucleotide as it is incorporated into 
the growing strand of DNA in the polymerase reaction. The fundamental requirements for such a system to work are: 
(1) the availability of 4 nucleotide analogues (aA, aC, aQ, aT) each labeled with a unique label and containing a chemical 
moiety capping the 3'-OH group; (2) the 4 nucleotide analogues (aA, aC, aG, aT) need to be efficiently and faithfully 

25 incorporated by DNA polymerase as terminators in the polymerase reaction; (3) the tag and the group capping the 3- 
OH need to be removed with high yield to allow the incorporation and detection of the next nucleotide; and (4) the growing 
strand of DNA should survive the washing, detection and cleavage processes to remain annealed to the DNA template. 
[0062] The sequencing by synthesis approach disclosed herein is illustrated in Figure 2A-2B. In Figure 2A, an 
example is shown where the unique labels are fluorescent dyes and the surface is a chip; in Figure 2B, the unique 

30 labels are mass tags and the surface is channels etched into a chip. The synthesis approach uses a solid surface such 
as a glass chip with an immobilized DNA template that is able to self prime for initiating the polymerase reaction, and 
four nucleotide analogues ( 3 '-ro- a "labeli. 3'-ro- c -labei_2> 3'-ro- g -labei_3« 3'-ro- t -label4> each labeled with a unique 
label, e.g. a fluorescent dye or a mass tag, at a specific location on the purine or pyrimidine base, and a small cleavable 
chemicaf group (R) to cap the 3-OH group. Upon adding the four nucleotide analogues and DNA polymerase, only one 

35 nucleotide analogue that is complementary to the next nucleotide on the template is incorporated by the polymerase on 
each spot of the surface (step 1 in Fig. 2A and 2B). 

[0063] As shown in Figure 2A, where the unique labels are dyes, after removing the excess reagents and washing 
away any unincorporated nucleotide analogues on the chip, a detector is used to detect the unique label. For example, 
a four color fluorescence imager is used to image the surface of the chip, and the unique fluorescence emission from a 

40 specific dye on the nucleotide analogues on each spot of the chip will reveal the identity of the incorporated nucleotide 
(step 2 in Fig. 2A). After imaging, the small amount of unreacted 3'-OH group on the self-primed template moiety is 
capped by excess dideoxynucleoside triphosphates (ddNTPs) (ddATP, ddGTP, ddTTP, and ddCTP) and DNA polymer- 
ase to avoid interference with the next round of synthesis (step 3 in Fig. 2A), a concept similar to the capping step in 
automated solid phase DNA synthesis (Caruthers, 1985). The ddNTPs, which lack a 3-hydroxyl group, are chosen to 

45 cap the unreacted 3-OH of the nucleotide due to their small size compared with the dye-labeled nucleotides, and the 
excellent efficiency with which they are incorporated by DNA polymerase. The dye moiety is then cleaved by light (-350 
nm), and the R group protecting the 3'-OH is removed chemically to generate free 3'-OH group with high yield (step 4 
in Fig. 2A). A washing step is applied to wash away the cleaved dyes and the R group. The self-primed DNA moiety 
on the chip at this stage is ready for the next cycle of the reaction to identify the next nucleotide sequence of the template 

50 DNA (step 5 in Fig 2A). 

[0064] It is a routine procedure now to immobilize high density (>10,000 spots per chip) single stranded DNA on a 
4cm x 1cm glass chip (Schena et al. 1995). Thus, in the DNA sequencing system disclosed herein, more than 10,000 
bases can be identified after each cycle and after 100 cycles, a million base pairs will be generated from one sequencing 
chip. 

55 [0065] Possible DNA polymerases include Thermo Sequenase, Taq FS DNA polymerase, T7 DNA polymerase, and 
Vent (exo-) DNA polymerase. The fluorescence emission from each specific dye can be detected using a fluorimeter 
that is equipped with an accessory to detect fluorescence from a glass slide. For large scale evaluation, a multi-color 
scanning system capable of detecting multiple different fluorescent dyes (500 nm - 700 nm) (GSI Lumonics ScanArray 
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5000 Standard Biochip Scanning System) on a glass slide can be used. 

[0066] An example of the sequencing by synthesis approach using mass tags is shown in Figure 2B, The approach 
uses a solid surface, such as a porous silica glass channels in a chip, with immobilized DNA template that is able to self 
prime for initiating the polymerase reaction, and four nucleotide analogues 

5 (3'-RO _A "Tagi - 3'-RO _c -Tag2' 3'-RO* G -Tag3» 3'-RO" T Tag4) eacn labeled with a unique photocleavable mass tag on the specific 
location of the base, and a small cleavable chemical group (R) to cap the 3'-OH group. Upon adding the four nucleotide 
analogues and DNA polymerase, only one nucleotide analogue that is complementary to the next nucleotide on the 
template is incorporated by polymerase in each channel of the glass chip (step 1 in Fig. 2B). After removing the excess 
reagents and washing away any unincorporated nucleotide analogues on the chip, the small amount of unreacted 3- 

10 OH group on the self-primed template moiety is capped by excess ddNTPs (ddATP, ddGTP, ddTTP and ddCTP) and 
DNA polymerase to avoid interference with the next round of synthesis (step 2 in Fig. 2B). The ddNTPs are chosen to 
cap the unreacted 3'-OH of the nucleotide due to their small size compared with the labeled nucleotides, and their 
excellent efficiency to be incorporated by DNA polymerase. The mass tags are cleaved by irradiation with light (-350 
nm) (step 3 in Fig. 2B) and then detected with a mass spectrometer. The unique mass of each tag yields the identity 

is of the nucleotide in each channel (step 4 in Fig. 2B). The R protecting group is then removed chemically and washed 
away to generate free 3'-OH group with high yield (step 5 in Fig. 2B). The self-primed DNA moiety on the chip at this 
stage is ready for the next cycle of the reaction to identify the next nucleotide sequence of the template DNA (step 6 in 
Fig.2B). 

[0067] Since the development of new ionization techniques such as matrix assisted laser desorption ionization (MALDI ) 

20 and electrospray ionization (ESI), mass spectrometry has become an indispensable tool in many areas of biomedical 
research. Though these ionization methods are suitable for the analysis of bioorganic molecules, such as peptides and 
proteins, improvements in both detection and sample preparation are required for implementation of mass spectrometry 
for DNA sequencing applications. Since the approach disclosed herein uses small and stable mass tags, there is no 
need to detect large DNA sequencing fragments directly and it is not necessary to use MALDI or ESI methods for 

25 detection. Atmospheric pressure chemical ionization (APCI) is an ionization method that uses a gas-phase ion-molecular 
reaction at atmospheric pressure (Dizidic et al. 1975). In this method, samples are introduced by either chromatography 
or flow injection into a pneumatic nebulizer where they are converted into small droplets by a high-speed beam of nitrogen 
gas. When the heated gas and solution arrive at the reaction area, the excess amount of solvent is ionized by corona 
discharge. This ionized mobile phase acts as the ionizing agent toward the samples and yields pseudo molecular (M+H) + 

30 and (M-H)- ions. Due to the corona discharge ionization method, high ionization efficiency is attainable, maintaining 
stable ionization conditions with detection sensitivity lower than femtomole region for small and stable organic compounds. 
However, due to the limited detection of large molecules, ESI and MALDI have replaced APCI for analysis of peptides 
and nucleic acids. Since in the approach disclosed the mass tags to be detected are relatively small and very stable 
organic molecules, the ability to detect large biological molecules gained by using ESI and MALDI is not necessary. 

35 APCI has several advantages over ESI and MALDI because it does not require any tedious sample preparation such 
as desalting or mixing with matrix to prepare crystals on a target plate. In ESI, the sample nature and sample preparation 
conditions (i.e. the existence of buffer or inorganic salts) suppress the ionization efficiency. MALDI requires the addition 
of matrix prior to sample introduction into the mass spectrometer and its speed is often limited by the need to search for 
an ideal irradiation spot to obtain interpretable mass spectra. These limitations are overcome by APCI because the mass 

*o tag solution can be injected directly with no additional sample purification or preparation into the mass spectrometer. 
Since the mass tagged samples are volatile and have small mass numbers, these compounds are easily detectable by 
APCI ionization with high sensitivity. This system can be scaled up into a high throughput operation. 
[0068] Each component of the sequencing by synthesis system is described in more detail below. 

45 2. Construction of a Surface Containing Immobilized Self-primed DNA Moiety 

[0069] The single stranded DNA template immobilized on a surface is prepared according to the scheme shown in 
Figure 3. The surface can be, for example, a glass chip, such as a 4cm x 1cm glass chip, or channels in a glass chip. 
The surface is first treated with 0.5 M NaOH, washed with water, and then coated with high density 3-aminopropyltri- 

50 methoxysilane in aqueous ethanol (Wool ley etal. 1 994) forming a primary amine surface. N-Hydroxy Succinimidyl (NHS) 
ester of triarylphosphine (1) is covalently coupled with the primary amine group converting the amine surface to a novel 
triarylphosphine surface, which specifically reacts with DNA containing an azido group (2) forming a chip with immobilized 
DNA. Since the azido group is only located at the 5' end of the DNA and the coupling reaction is through the unique 
reaction of the triarylphosphine moiety with the azido group in aqueous solution (Saxon and Bertozzi 2000), such a DNA 

55 surface will provide an optimal condition for hybridization. 

[0070] The NHS ester of triarylphosphine (1) is prepared according to the scheme shown in Figure 4. 3-diphenyl- 
phosphino-4-methoxycarbonyl-benzoic acid (3) is prepared according to the procedure described by Bertozzi et al. 
(Saxon and Bertozzi 2000). Treatment of (3) with N-Hydroxysuccinimide forms the corresponding NHS ester (4). Coupling 
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of (4) with an amino carboxyiic acid moiety produces compound (5) that has a long linker (n = 1 to 10) for optimized 
coupling with DNA on the surface. Treatment of (5) with N-Hydroxysuccinimide generates the NHS ester (1 ) which is 
ready for coupling with the primary amine coated surface (Figure 3). 

[0071] The azido labeled DNA (2) is synthesized according to the scheme shown in Figure 5. Treatment of ethyl ester 
5 of 5-bromovaleric acid with sodium azide and then hydrolysis produces 5-azidovaleric acid (Khoukhi et al., 1 987), which 
is subsequently converted to a NHS ester for coupling with an amino linker modified oligonucleotide primer. Using the 
azido-labeled primer to perform polymerase chain reaction (PCR) reaction generates azido-labeled DNA template (2) 
for coupling with the triarylphosphine-modified surface (Figure 3). 

[0072] The self-primed DNA template moiety on the sequencing chip is constructed as shown in Figure 6 (A & 8) 
10 using enzymatic ligation. A 5-phosphorylated, 3-OH capped loop oligonucleotide primer (B) is synthesized by a solid 
phase DNA synthesizer. Primer (B) is synthesized using a modified C phosphoramidite whose 3'-OH is capped with 
either a MOM (-CH 2 OCH 3 ) group or an allyl (-CH 2 CH=CH 2 ) group (designated by "R" in Figure 6) at the 3'-end of the 
oligonucleotide to prevent the self ligation of the primer in the ligation reaction. Thus, the looped primer can only ligate 
to the 3'-end of the DNA templates that are immobilized on the sequencing chip using T4 RNA ligase (Zhang et al. 1 996) 
15 to form the self-primed DNA template moiety (A). The looped primer (B) is designed to contain a very stable loop (Antao 
et al. 1991) and a stem containing the sequence of M13 reverse DNA sequencing primer for efficient priming in the 
polymerase reaction once the primer is ligated to the immobilized DNA on the sequencing chip and the 3'-OH cap group 
is chemically cleaved off (Ireland et al. 1986; Kamal et al. 1999). 

20 3. Sequencing by Synthesis Evaluation Using Nucleotide 

Analogues 3 '-HCf A -Dyelr' SMO^'Dy** a'-HO^-DyeS' 3'-HO" T -Dye4 

[0073] A scheme has been developed for evaluating the photocleavage efficiency using different dyes and testing the 
sequencing by synthesis approach. Four nucleotide analogues 3 .. H0 -A- Dye1 , 3 .. HO -C- Dye2 , 3 .. H o- G -Dye3' 3- HO* T -Dye4 
25 each labeled with a unique fluorescent dye through a photocleavable linker are synthesized and used in the sequencing 
by synthesis approach. Examples of dyes include, but are not limited to: Dye1 = FAM, 5-carboxyfluorescein; Dye2 = 
R6G, 6-carboxyrhodamine-6G; Dye3 = TAM, N,N,N\NMetramethyl-6-carboxyrhodamine; and Dye4 = ROX, 6-carboxy- 
X-rhodamine. The structures of the 4 nucleotide analogues are shown in Figure 7 (R = h). 

[0074] The photocleavable 2-nitrobenzyl moiety has been used to link biotin to DNA and protein for efficient removal 
30 by UV light (- 350 nm) (Olejnik et al. 1995, 1999). In the approach disclosed herein the 2-nitrobenzyl group is used to 
bridge the fluorescent dye and nucleotide together to form the dye labeled nucleotides as shown in Figure 7. 
[0075] As a representative example, the synthesis of 3 '_ HO -G- Dye3 (Dye3 = Tarn) is shown in Figure 8. 7-deaza- 
alkynylarnino-dGTP is prepared using well-established procedures (Prober et al. 1987; Lee et al. 1992 and Hobbs et al. 
1991). Linker-Tarn is synthesized by coupling the Photocleavable Linker (Rollaf 1982) with NHS-Tam. 7-deaza- 
35 alkynylamino-dGTP is then coupled with the Linker-Tarn to produce 3 _ H0 -G-TAM. The nucleotide analogues with a free 
3*-OH (i.e., R = H) are good substrates for the polymerase. An immobilized DNA template is synthesized (Figure 9) that 
contains a portion of nucleotide sequence ACGTACGACGT (SEQ ID NO: 1 ) that has no repeated sequences after the 
priming site. 3 '-HO~A-Dyei and DNA polymerase are added to the self-primed DNA moiety and it is incorporated to the 3' 
site of the DNA. Then the steps in Figure 2A are followed (the chemical cleavage step is not required here because the 
40 3'-OH is free) to detect the fluorescent signal from Dye-1 at 520 nm. Next, 3 '_HO' c *Dye2 is added to image the fluorescent 
signal from Dye-2 at 550 nm. Next, 3 --HO* G "Dye3 ' s added to image the fluorescent signal from Dye-3 at 580 nm, and 
finally 3 '. H o _T "Dye4 js added to image the fluorescent signal from Dye-4 at 610 nm. 

Results on photochemical cleavage efficiency 

45 

[0076] The expected photolysis products of DNA containing a photocleavable fluorescent dye at the 3' end of the DNA 
are shown in Figure 10. The 2-nitrobenzyl moiety has been successfully employed in a wide range of studies as a 
photocleavable-protecting group (Pillai 1980). The efficiency of the photocleavage step depends on several factors 
including the efficiency of light absorption by the 2-nitrobenzyl moiety, the efficiency of the primary photochemical step, 

50 and the efficiency of the secondary thermal processes which lead to the final cleavage process (Turro 1991 ). Burgess 
et al. (1997) have reported the successful photocleavage of a fluorescent dye attached through a 2-nitrobenzyl linker 
on a nucleotide moiety, which shows that the fluorescent dye is not quenching the photocleavage process. A photoliable 
protecting group based on the 2-nitrobenzyl chromophore has also been developed for biological labeling applications 
that involve photocleavage (Olejnik et al. 1999). The protocol disclosed herein is used to optimize the photocleavage 

55 process shown in Figure 10. The absorption spectra of 2-nitro benzyl compounds are examined and compared quan- 
titatively to the absorption spectra of the fluorescent dyes. Since there will be a one-to-one relationship between the 
number of 2-nitrobenzyl moieties and the dye molecules, the ratio of extinction coefficients of these two species will 
reflect the competition for light absorption at specific wavelengths. From this information, the wavelengths at which the 
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2-nitrobenzyl moieties absorbed most competitively can be determined, similar to the approach reported by Olejnik et 
al. (1995). 

[0077] A photolysis setup can be used which allows a high throughput of monochromatic light from a 1000 watt high 
pressure xenon lamp (LX1000UV, ILC) in conjunction with a monochromator (Kratos, Schoeffel Instruments). This 

5 instrument allows the evaluation of the photocleavage of model systems as a function of the intensity and excitation 
wavelength of the absorbed light. Standard analytical analysis is used to determine the extent of photocleavage. From 
this information, the efficiency of the photocleavage as a function of wavelength can be determined. The wavelength at 
which photocleavage occurs most efficiently can be selected as for use in the sequencing system. 
[0078] Photocleavage results have been obtained using a model system as shown in Figure 11. Coupling of PC-LC- 

10 Biotin-NHS ester (Pierce, Rockford IL) with 5-(aminoacetamido)-fluorescein (5-aminoFAM) (Molecular Probes, Eugene 
OR) in dimethylsulfonyl oxide (DMSO)/NaHC0 3 (pH=8.2) overnight at room temperature produces PC-LC-Biotin-FAM 
which is composed of a biotin at one end, a photocleavable 2-nitrobenzyl group in the middle, and a dye tag (FAM) at 
the other end. This photocleavable moiety closely mimics the designed photocleavable nucleotide analogues shown in 
Figure 10. Thus the successful photolysis of the PC-LC -Biotin -FAM moiety provides proof of the principle of high 

15 efficiency photolysis as used in the DNA sequencing system. For photolysis study, PC-LC -Biotin -FAM is first immobilized 
on a microscope glass slide coated with streptavidin (XENOPORE, Hawthorne NJ). After washing off the non-immobilized 
PC-LC-Biotin-FAM, the fluorescence emission spectrum of the immobilized PC-LC-Biotin-FAM was taken as shown 
in Figure 12 (Spectrum a). The strong fluorescence emission indicates that PC-LC-Biotin-FAM is successfully immo- 
bilized to the streptavidin coated slide surface. The photocleavability of the 2-nitrobenzyl linker by irradiation at 350 nm 

20 was then tested. After 1 0 minutes of photolysis (X^ = 350 nm; -0.5 mW/cm 2 ) and before any washing, the fluorescence 
emission spectrum of the same spot on the slide was taken that showed no decrease in intensity (Figure 12, Spectrum 
b), indicating that the dye (FAM) was not bleached during the photolysis process at 350 nm. After washing the glass 
slide with HPLC water following photolysis, the fluorescence emission spectrum of the same spot on the slide showed 
significant intensity decrease (Figure 12, Spectrum c) which indicates that most of the fluorescence dye (FAM) was 

25 cleaved from the immobilized biotin moiety and was removed by the washing procedure. This experiment shows that 
high efficiency cleavage of the fluorescent dye can be obtained using the 2-nitrobenzyl photocleavable linker. 

4. Sequencing by Synthesis Evaluation Using Nucleotide 

Analogues 3'.RCfA- Dy el , 3 -RO _C "Dye2» 3'-RO" G "Dye3> 3'-RO _T ~Dye4 

30 

[0079] Once the steps and conditions in Section 3 are optimized, the synthesis of nucleotide 
analogues 3--Ro- A_ Dyei' 3'-RO~C~Dye2' 3'-RO~G"Dye3» 3 -RO" T 'Dye4 Can & e pursued for further study of the system. Here the 
3'-OH is capped in all four nucleotide analogues, which then can be mixed together with DNA polymerase and used to 
evaluate the sequencing system using the scheme in Figure 9. The MOM (-CH 2 OCH 3 ) or allyl (-CH 2 CH=CH 2 ) group is 
35 used to cap the 3'-OH group using well-established synthetic procedures (Figure 13) (Fuji et al. 1975, Metzker et al. 
1994). These groups can be removed chemically with high yield as shown in Figure 14 (Ireland, et al. 1986; Kamal et 
al. 1999). The chemical cleavage of the MOM and ally! groups is fairly mild and specific, so as not to degrade the DNA 
template moiety. For example, the cleavage of the allyl group takes 3 minutes with more than 93% yield (Kamal et al. 
1999), while the MOM group is reported to be cleaved with close to 100% yield (Ireland, et al. 1986). 

40 

5. Using Energy Transfer Coupled Dyes To Optimize The Sequencing By Synthesis System 

[0080] The spectral property of the fluorescent tags can be optimized by using energy transfer (ET) coupled dyes. 
[0081] The ET primer and ET dideoxynucleotides have been shown to be a superior set of reagents for 4-color DNA 

45 sequencing that allows the use of one laser to excite multiple sets of fluorescent tags ( Ju et al. 1 995). It has been shown 
that DNA polymerase (Thermo Sequenase and Taq FS) can efficiently incorporate the ETdye labeled dideoxynucleotides 
(Rosenblum et al. 1 997). These ET dye-labeled sequencing reagents are now widely used in large scale DNA sequencing 
projects, such as the human genome project. A library of ET dye labeled nucleotide analogues can be synthesized as 
shown in Figure 1 5 for optimization of the DNA sequencing system. The ET dye set (FAM-CI 2 FAM, FAM-CI 2 R6G, FAM- 

50 CI 2 TAM, FAM-CI 2 ROX) using FAM as a donor and dichloro(FAM, R6G, TAM, ROX) as acceptors has been reported in 
the literature (Lee et al. 1997) and constitutes a set of commercially available DNA sequencing reagents. These ET dye 
sets have been proven to produce enhanced fluorescence intensity, and the nucleotides labeled with these ET dyes at 
the 5-position of T and C and the 7-position of G and A are excellent substrates of DNA polymerase. Alternatively, an 
ET dye set can be constructed using cyanine (Cy2) as a donor and CI 2 FAM, CI 2 R6G, CI 2 TAM, or CI 2 ROX as energy 

55 acceptors. Since Cy2 possesses higher molar absorbance compared with the rhodamine and fluorescein derivatives, 
an ET system using Cy2 as a donor produces much stronger fluorescence signals than the system using FAM as a 
donor (Hung et al. 1996). Figure 16 shows a synthetic scheme for an ET dye labeled nucleotide analogue with Cy2 as 
a donor and CI 2 FAM as an acceptor using similar coupling chemistry as for the synthesis of an energy transfer system 
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using FAM as a donor (Lee et al. 1997). Coupling of CI 2 FAM (I) with spacer 4-aminomethylbenzoic acid (II) produces 
III, which is then converted to NHS ester IV. Coupling of IV with amino-Cy2, and then converting the resulting compound 
to a NHS ester produces V, which subsequently couples with amino-photolinker nucleotide VI yields the ET dye labeled 
nucleotide VII. 

5 

6. Sequencing by synthesis evaluation using nucleotide 

analogues 3'-HO- A Tag1,3'-HO- C -Tag2' a'-HO^-TagS' 3'-HO' T Tag4 

[0082] The precursors of four examples of mass tags are shown in Figure 17. The precursors are: (a) acetophenone; 

10 (b) 3-fluoroacetophenone; (c) 3,4-difluoroacetophenone; and (d) 3,4-dimethoxyacetophenone. Upon nitration and re- 
duction, four photoactive tags are produced from the four precursors and used to code for the identity of each of the four 
nucleotides (A, C, G, T). Clean APCI mass spectra are obtained for the four mass tag precursors (a, b f c, d) as shown 
in Figure 18. The peak with m/z of 121 is a, 139 is b, 157 is c, and 181 is d. This result shows that these four mass 
tags are extremely stable and produce very high resolution data in an APCI mass spectrometer with no cross talk between 

15 the mass tags. In the examples shown below, each of the unique m/z from each mass tag translates to the identity of 
the nucleotide [Tag-1 (m/z,150) = A; Tag-2 (m/z ( 168) = C; Tag-3 (m/z,186) = G; Tag-4 (m/z,210) = TJ. 
[0083] Different combinations of mass tags and nucleotides can be used, as indicated by the general 
scheme: 3( . H o- A -Tagi» 3'-HO- c Tag2. 3'-HO-G-Ta 9 3. 3 ^HO- T ^ag4 wh ereTag1,Tag2,Tag3, and Tag4 are four different unique 
cleavable mass tags. Four specific examples of nucleotide analogues are shown in Figure 19. In Figure 19, "R" is H 

20 when the 3'-OH group is not capped. As discussed above, the photo cleavable 2-nitro benzyl moiety has been used to 
link biotin to DNA and protein for efficient removal by UV light (- 350 nm) irradiation (Olejnik et al. 1995, 1999). Four 
different 2-nitro benzyl groups with different molecular weights as mass tags are used to form the mass tag labeled 
nucleotides as shown in Figure 19: 2-nitro-a-methyl-benzyl (Tag-1) codes for A; 2-nitro-ot-methyl-3-fluorobenzyl (Tag- 
2) codes for C; 2-nitro-a-methyl-3,4-difluorobenzyl (Tag-3) codes for G; 2-nitro-a-methyl-3,4-dimethoxybenzyi (Tag-4) 

25 codes for T. 

[0084] As a representative example, the synthesis of the NHS ester of one mass tag (Tag-3) is shown in Figure 20. 
A similar scheme is used to create the other mass tags. The synthesis of 3'.HO" G *Tag3 js shown in Figure 21 using well- 
established procedures (Prober et al. 1987; Lee et al. 1992 and Hobbs et al. 1991). 7-propargylamino- dGTP is first 
prepared by reacting 7-l-dGTP with N-trifluoroacetylpropargyl amine, which is then coupled with the NHS-Tag-3 to 

30 produce 3'.ho" g " Tag3- Tne nucleotide analogues with a free 3-OH are good substrates for the polymerase. 

[0085] The sequencing by synthesis approach can be tested using mass tags using a scheme similar to that show for 
dyes in Figure 9. A DNA template containing a portion of nucleotide sequence that has no repeated sequences after 
the priming site, is synthesized and immobilized to a glass channel. 3'.HO" A Tagi ancl & NA polymerase are added to the 
self-primed DNA moiety to allow the incorporation of the nucleotide into the 3' site of the DNA. Then the steps in Figure 

35 2B are followed (the chemical cleavage is not required here because the 3-OH is free) to detect the mass tag from Tag- 
1 (m/z = 150). Next, 3'-ho~C" Tag2 ' s added and the resulting mass spectra is measured after cleaving Tag-2 (m/z = 168). 
Next, 3 '.HO- G_ Tag3 and 3'-ho- t_ Tag4 are added in turn and the mass spectra of the cleavage products Tag-3 (m/z =186) 
and Tag-4 (m/z = 210) are measured. Examples of expected photocleavage products are shown in Figure 22. The 
photocleavage mechanism is as described above for the case where the unique labels are dyes. Light absorption (300 

40 - 360 nm) by the aromatic 2-nitro benzyl moiety causes reduction of the 2-nitro group to a nitroso group and an oxygen 
insertion into the carbon-hydrogen bond located in the 2-position followed by cleavage and decarboxylation (Pillai 1980). 
[0086] The synthesis of nucleotide analogues 3 '. R0 - A -Tagi » 3'-Rcr c -Tag2' 3'-RO _G -Tag3' 3'-R 0 - T -Tag4 can be pursued for 
further study of the system a discussed above for the case where the unique labels are dyes. Here the 3-OH is capped 
in all four nucleotide analogues, which then can be mixed together with DNA polymerase and used to evaluate the 

45 sequencing system using a scheme similar to that in Figure 9. The MOM (-CH 2 OCH 3 ) or allyl (-CH 2 CH=CH 2 ) group is 
used to cap the 3-OH group using well-established synthetic procedures (Figure 13) (Fuji et al. 1975, Metzker et al. 
1994). These groups can be removed chemically with high yield as shown in Figure 14 (Ireland, et al. 1986; Kamal et 
al. 1 999). The chemical cleavage of the MOM and allyl groups is fairly mild and specific, so as not to degrade the DNA 
template moiety. 

50 

8. Validate the Complete Sequencing by Synthesis System By Sequencing P53 Genes 

[0087] The tumor suppressor gene p53 can be used as a model system to validate the DNA sequencing system. The 
p53 gene is one of the most frequently mutated genes in human cancer (O'Connor et al. 1997). First, a base pair DNA 
55 template (shown below) is synthesized containing an azido group at the 5' end and a portion of the sequences from 
exon 7 and exon 8 of the p53 gene: 

5-N3-TTCCTGCATGGGCGGCATGAACCCGAGGCCCATCCTCACCATCATCAC 
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ACTGGAAGACTCCAGTGGTAATCTACTGGGACGGAACAGCTTTGAGGTGCATT -3' (SEQ ID NO: 2). 

[0088] This template is chosen to explore the use of the sequencing system for the detection of clustered hot spot 
single base mutations. The potentially mutated bases are underlined (A, G, C and T) in the synthetic template. The 
synthetic template is immobilized on a sequencing chip or glass channels, then the loop primer is ligated to the immobilized 
template as described in Figure 6, and then the steps in Figure 2 are followed for sequencing evaluation. DNA templates 
generated by PCR can be used to further validate the DNA sequencing system. The sequencing templates can be 
generated by PCR using flanking primers (one of the pair is labeled with an azido group at the 5' end) in the intron region 
located at each p53 exon boundary from a pool of genomic DNA (Boehringer, Indianapolis, IN) as described by Fu et 
al. (1998) and then immobilized on the DNA chip for sequencing evaluation. 
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SEQUENCE LISTING 
[0090] 

5 <1 1 0> The Trustees Of Columbia University In The City Of 

<120> Massive Parallel Method For Decoding DNA And RNA 
<130> 0575/62239-B-PCT/JPW 

10 

<140> Not Yet Known 
<141> 2001-10-05 

<150> 09/684,670 
15 <1 51 > 2000-10-06 

<160>2 

<170> PatentlnVer. 2.1 

20 

<210> 1 
<211>11 
<212> DNA 

<213> Artificial Sequence 

25 

<220> 

<223> Description of Artificial Sequence: template 
<400> 1 

30 acgtacgacg t 1 1 

<210>2 
<211> 101 
<212> DNA 
35 <21 3> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: template 
40 <400> 2 

ttcctgcatg ggcggcatga acccgaggcc catcctcacc atcatcacac tggaagactc 60 
cagtggtaat ctactgggac ggaacagctt tgaggtgcat t 101 

45 



Claims 

50 1 . A method for sequencing a nucleic acid by sequentially determining the identity of nucleotide analogues after the 
nucleotide analogues are incorporated into a growing strand of DNA in a polymerase reaction, which comprises the 
following steps: 

(i) attaching a 5' end of the nucleic acid to a solid surface; 
55 (ii) attaching a primer to the nucleic acid attached to the solid surface; 

(iii) adding a polymerase and one or more different nucleotide analogues to the nucleic acid to thereby Incorporate 
a nucleotide analogue into the growing strand of DNA, wherein the incorporated nucleotide analogue terminates 
the polymerase reaction and wherein each different nucleotide analogue comprises (a) a base selected from 
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the group consisting of adenine, guanine, cytosine, thymine, and uracil, and their analogues; (b) a unique label 
attached through a cleavable linker to the base or to an analogue of the base; (c) a deoxyribose; and (d) a 
cleavable chemical group to cap an -OH group at a 3-position of the deoxyribose, wherein the cleavable chemical 
group is -CH 2 OCH 3 or -CH 2 CH=CH 2 ; 
s (iv) washing the solid surface to remove unincorporated nucleotide analogues; 

(v) determining the identity of the unique label attached to the nucleotide analogue that has been incorporated 
into the growing strand of DNA, so as to thereby identify the incorporated nucleotide analogue; 

(vi) adding one or more chemical compounds to permanently cap any unreacted -OH group on the primer 
attached to the nucleic acid or on a primer extension strand formed by adding one or more nucleotides or 

10 nucleotide analogues to the primer; 

(vii) cleaving the cleavable linker between the nucleotide analogue that was incorporated into the growing strand 
of DNA and the unique label; 

(viii) cleaving the cleavable chemical group capping the -OH group at the 3'-position of the deoxyribose to uncap 
the -OH group, and washing the solid surface to remove cleaved compounds; and 

15 (ix) repeating steps (iii) through (viii) so as to determine, for each repetition, the identity of the newly incorporated 

nucleotide analogue into the growing strand of DNA; 

wherein if the unique label is a dye, the order in which steps (v) through (vii) are performed is: (v), (vi) and (vii); and 
if the unique label is a mass tag, the order in which steps (v) through (vii) are performed is: (vi), (vii) and (v). 

20 

2. The method of claim 1 , wherein the solid surface is glass, silicon, or gold. 

3. The method of claim 1 , wherein the solid surface is a magnetic bead, a chip, a channel in a chip, or a porous channel 
in a chip. 

25 

4. The method of claim 1 , wherein the step of attaching the nucleic acid to the solid surface comprises: 

(i) coating the solid surface with a phosphine moiety, 

(ii) attaching an azido group to the 5* end of the nucleic acid, and 

30 (nj) immobilizing the 5' end of the nucleic acid to the solid surface through interaction between the phosphine 

moiety on the solid surface and the azido group on the 5' end of the nucleic acid. 

5. The method of claim 4, wherein the step of coating the solid surface with the phosphine moiety comprises: 

35 (i) coating the surface with a primary amine, and 

(ii) covalently coupling a N-hydroxysuccinimidyl ester of triarylphosphine with the primary amine. 

6. The method of claim 1 , wherein the nucleic acid that is attached to the solid surface is a single-stranded DNA. 

40 7. The method of claim 1 , wherein the nucleic acid that is attached to the solid surface in step (i) is a double-stranded 
DNA, wherein only one strand is directly attached to the solid surface, and wherein the strand that is not directly 
attached to the solid surface is removed by denaturing before proceeding to step (ii). 

8. The method of claim 1 , wherein the nucleic acid that is attached to the solid surface is a RNA, and the polymerase 
45 in step (iii) is reverse transcriptase. 

9. The method of claim 1 , wherein the primer is attached to a 3* end of the nucleic acid in step (ii) and wherein the 
attached primer comprises a stable loop and an -OH group at a 3'-position of a deoxyribose capable of self-priming 
in the polymerase reaction. 

so 

10. The method of claim 1 , wherein the step of attaching the primer to the nucleic acid comprises hybridizing the primer 
to the nucleic acid or ligating the primer to the nucleic acid. 

11. The method of claim 1, wherein one or more of four different nucleotide analogues is added in step (iii), wherein 
55 each different nucleotide analogue comprises a different base selected from the group consisting of thymine or 

uracil or an analogue of thymine or uracil, adenine or an analogue of adenine, cytosine or an analogue of cytosine, 
and guanine or an analogue of guanine, and wherein each of the four different nucleotide analogues comprises a 
unique label. 
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12. The method of claim 1, wherein the cleavable chemical group that caps the -OH group at the 3' -position of the 
deoxyribose in the nucleotide analogue is - CH 2 CH=CH 2 . 

1 3. The method of claim 1 , wherein the unique label that is attached to the nucleotide analogue is a fluorescent moiety 
5 or a fluorescent semiconductor crystal. 

1 4. The method of claim 13, wherein the fluorescent moiety is selected from the group consisting of 5-carboxyfluorescein, 
6-carboxyrhodamine-6G, N.N.N'.N'-tetramethyl-e-carboxyrhodamine, and 6-carboxy-x-rhodamine. 

io 1 5. The method of claim 1 , wherein the unique label that is attached to the nucleotide analogue is a fluorescence energy 
transfer tag which comprises an energy transfer donor and an energy transfer acceptor. 

16. The method of claim 15, wherein the energy transfer donor is 5-carboxyfluorescein or cyanine, and wherein the 
energy transfer acceptor is selected from the group consisting of dichlorocarboxyfluorescein, dichloro-6-carbox- 

15 yrhodamine-6G, dichloro-N.N.N'.N'-tetramethyl-e-carboxyrhodamine, and dichloro-6-carboxy-X-rhodamine. 

17. The method of claim 1 , wherein the unique label that is attached to the nucleotide analogue is a mass tag that can 
be detected and differentiated by a mass spectrometer. 

20 18. The method of claim 17, wherein the mass tag is selected from the group consisting of a 2-nitro-oc-methyl-benzyl 
group, a 2-nitro-cc-methyl-3-fluorobenzyl group, a 2-nitro-a-methyl-3,4-difluorobenzyl group, and a 2-nitro-a-methyl- 
3,4-dimethoxybenzyl group. 

19. The method of claim 1, wherein the unique label is attached through a cleavable linker to a 5-position of cytosine 
25 or thymine or to a 7-position of deaza-adenine or deaza-guanine. 

20. The method of claim 1 , wherein the cleavable linker between the unique label and the nucleotide analogue is cleaved 
by a means selected from the group consisting of one or more of a physical means, a chemical means, a physical 
chemical means, heat, and light. 

30 

21. The method of claim 20, wherein the cleavable linker is a photocleavable linker which comprises a 2-nitrobenzyl 
moiety. 

22. The method of claim 1 , wherein the cleavable chemical group used to cap the -OH group at the 3'-position of the 
35 deoxyribose is cleaved by a means selected from the group consisting of one or more of a physical means, a 

chemical means, a physical chemical means, heat, and light. 

23. The method of claim 1 , wherein the chemical compounds added in step (vi) to permanently cap any unreacted -OH 
group on the primer attached to the nucleic acid or on the primer extension strand are a polymerase and one or 

40 more different dideoxynucleotides or analogues of dideoxynucleotides. 

24. The method of claim 23, wherein the different dideoxynucleotides are selected from the group consisting of 2\3- 
dideoxyadenosine ^-triphosphate, 2\3'-dideoxyguanosine 5-triphosphate, 2\3'-dideoxycytidine 5'-triphosphate, 2\ 
3'-dideoxythymidine 5-triphosphate, 2\3'-dideoxyuridine 5'-triphosphase, and their analogues. 

45 

25. The method of claim 1 , wherein a polymerase and one or more of four different dideoxynucleotides are added in 
step (vi), and wherein each different dideoxynucleotide is selected from the group consisting of 2'3'-dideoxyade- 
nosine 5'-triphosphate or an analogue of 2',3-dideoxyadenosine 5-triphosphate; 2',3-dideoxyguanosine 5-triphos- 
phate or an analogue of 2',3'-dideoxyguanosine 5-triphosphate; 2\3'-dideoxycytidine 5'-triphosphate or an analogue 

50 of 2',3'-dideoxycytidine5'-triphosphate; and 2\3'-dideoxythymidine 5-triphosphate or 2',3'-dideoxyuridine5'-triphos- 

phase or an analogue of 2\3'-dideoxythymidine 5'-triphosphate or an analogue of 2',3'-dideoxyuridine 5'-triphos- 
phase. 

26. The method of claim 1 7, wherein the mass tag is detected using a parallel mass spectrometry system which comprises 
55 a plurality of atmospheric pressure chemical ionization mass spectrometers for parallel analysis of a plurality of 

samples comprising mass tags. 

27. A method of simultaneously sequencing a plurality of different nucleic acids, which comprises simultaneously applying 
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the method of claim 1 to the plurality of different nucleic acids. 

28. Use of the method of claim 1 or 27 for detection of single nucleotide polymorphisms, genetic mutation analysis, 
serial analysis of gene expression, gene expression analysis, identification in forensics, genetic disease association 

5 studies, DNA sequencing, genomic sequencing, translational analysis, or transcriptional analysis. 

29. A method of attaching a nucleic acid to a solid surface which comprises: 

(i) coating the solid surface with a phosphine moiety, 
10 (ii) attaching an azido group to a 5' end of the nucleic acid, and 

(iii) immobilizing the 5' end of the nucleic acid to the solid surface through interaction between the phosphine 
moiety on the solid surface and the azido group on the 5' end of the nucleic acid. 



15 
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30. The method of claim 29, wherein the step of coating the solid surface with the phosphine moiety comprises: 

(i) coating the surface with a primary amine, and 

(ii) covalently coupling a N-hydroxysuccinimidyl ester of triarylphosphine with the primary amine. 

31. The method of claim 29, wherein the solid surface is glass, silicon, or gold. 

32. The method of claim 29, wherein the solid surface is a magnetic bead, a chip, a channel in a chip, or a porous 
channel in a chip. 

33. The method of claim 29, wherein the nucleic acid that is attached to the solid surface is a single-stranded DNA, a 
25 double-stranded DNA or a RNA. 

34. The method of claim 33, wherein the nucleic acid is a double-stranded DNA and only one strand is attached to the 
solid surface. 

30 35. The method of claim 34, wherein the strand of the double-stranded DNA that is not attached to the solid surface is 
removed by denaturing. 

36. Use of the method of claim 29 for attaching a nucleic acid to a solid surface for gene expression analysis, microarray 
based gene expression analysis, mutation detection, translational analysis, or transcriptional analysis of said nucleic 

35 acid. 

37. A nucleotide analogue which comprises: 

(a) a base selected from the group consisting of adenine or an analogue of adenine, cytosine or an analogue 
40 of cytosine, guanine or an analogue of guanine, thymine or an analogue of thymine, and uracil or an analogue 

of uracil; 

(b) a unique label attached through a cleavable linker to the base or to an analogue of the base; 

(c) a deoxyribose; and 

(d) a cleavable chemical group to cap an -OH group at a 3'-position of the deoxyribose, wherein the cleavable 
45 chemical group is -CH 2 OCH 3 or -CH 2 CH=CH 2 . 

38. The nucleotide analogue of claim 37, wherein the cleavable chemical group that caps the -OH group at the 3'- 
position of the deoxyribose in the nucleotide analogue is -CH 2 CH=CH 2 . 

so 39. The nucleotide analogue of claim 37, wherein the unique label is a fluorescent moiety or a fluorescent semiconductor 
crystal. 

40. The nucleotide analogue of claim 39, wherein the fluorescent moiety is selected from the group consisting of 5- 
carboxyfluorescein, 6-carboxyrhodamine-6G, N,N,N\NMetramethyl-6-carboxyrhodamine, and 6-carboxy-X-rhod- 

55 amine. 

41 . The nucleotide analogue of claim 37, wherein the unique label is a fluorescence energy transfer tag which comprises 
an energy transfer donor and an energy transfer acceptor. 
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42. The nucleotide analogue of claim 41, wherein the energy transfer donor is 5-carboxyfluorescein or cyanine, and 
wherein the energy transfer acceptor is selected from the group consisting of dichlorocarboxyfluorescein, dichloro- 
6-carboxyrhodamine-6G t dichloro-N.N.N'.N'-tetramethyl-e-carboxyrhodamine, and dichloro-6-carboxy-X-rhodam- 
ine. 

43. The nucleotide analogue of claim 37, wherein the unique label is a mass tag that can be detected and differentiated 
by a mass spectrometer. 



44. The nucleotide analogue of claim 43, wherein the mass tag is selected from the group consisting of a 2-nitro-a- 
methyl-benzyl group, a 2-nitro-a-methyl-3-fluorobenzyl group, a 2-nitro-a-methyl-3,4-difluorobenzyl group, and a 2- 
nitro-a-methyl-3,4-dimethoxybenzyl group. 



45. The nucleotide analogue of claim 37, wherein the unique label is attached through a cleavable linker to a 5-position 
of cytosine or thymine or to a 7-position of deaza-adenine or deaza-guanine. 

15 

46. The nucleotide analogue of claim 37, wherein the linker between the unique label and the nucleotide analogue is 
cleavable by a means selected from the group consisting of one or more of a physical means, a chemical means, 
a physical chemical means, heat, and light. 

20 47. The nucleotide analogue of claim 46, wherein the cleavable linker is a photocleavable linker which comprises a 2- 
nitrobenzyl moiety. 

48. The nucleotide analogue of claim 37, wherein the cleavable chemical group used to cap the -OH group at the 3- 
position of the deoxyribose is cleavable by a means selected from the group consisting of one or more of a physical 

25 means, a chemical means, a physical chemical means, heat, and light. 

49. The nucleotide analogue of claim 37, wherein the nucleotide analogue is selected from the group consisting of: 
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O O O H,« 

6- 6- 6 



i 



and 



o o o 

6- 6- 6- krf^> 

wherein Tag.,, Tag 2 , Tag 3 , and Tag 4 are four different mass tag labels; and 
wherein R is -CH 2 OCH 3 or -CH 2 CH=CH 2 . 

52. The nucleotide analogue of claim 51, wherein the nucleotide analogue is selected from the group consisting of: 

uu H 



O 0 o 

6- 6- A- 



o- <r o- 
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w 

and 

15 



O O 0 KtN „ , 

6- 6- * Ir^jl 



0 o o o* 

6- a- & ^ • 

25 wherein R is -CH 2 OCH 3 or -CH 2 CH=CH 2 . 

53. Use of the nucleotide analogue of claim 37 for detection of single nucleotide polymorphisms, genetic mutation 
analysis, serial analysis of gene expression, gene expression analysis, identification in forensics, genetic disease 
association studies, DNA sequencing, genomic sequencing, translational analysis, or transcriptional analysis. 

30 

Patentanspruche 

1 . Verfahren zum Sequenzieren einer Nukleinsaure durch sequentielles Bestimmen der Identitat von Nukleotidanaloga 
35 nachdem die Nukleotidanaloga in einen wachsenden Strang von DNA eingebaut sind, in einer Polymerasereaktion, 

die die folgenden Schritte umfasst: 

(i) Anheften eines 5' Endes der Nukleinsaure an eine feste Oberflache; 

(ii) Anheften eines Primers an die an die feste Oberflache angeheftete Nukleinsaure; 

to (jjj) Zugeben einer Polymerase und eines Oder mehrerer verschiedener Nukleotidanaloga zu der Nukleinsaure 

urn dadurch ein Nukleotidanalogon in den wachsenden Strang von DNA einzubauen, wobei das eingebaute 
Nukleotid ana logon die Polymerasereaktion beendet, und wobei jedes verschiedene Nukleotidanalogon umfasst 
(a) eine Base ausgewahlt aus der Gruppe bestehend aus Adenin, Guanin, Cytosin, Thymin, und Uracil, und 
ihrer Analoga; (b) eine eindeutige Markierung anheftet uber einen spaltbaren Linker an die Base Oder an ein 

45 Analogon der Base; (c) eine Desoxyribose; und (d) eine spaltbare chemische Gruppe urn eine -OH-Gruppe an 

einer 3'-Position der Desoxyribose zu schOtzen, wobei die spaltbare chemische Gruppe -CH 2 OCH 3 oder 
-CH 2 CH=CH 2 ist; 

(iv) Waschen der festen Oberflache, urn nicht-eingebaute Nukleotidanaloga zu entfernen; 

(v) Bestimmen der Identitat der eindeutigen Markierung, angehangt an das Nukleotidanalogon, das in den 
50 wachsenden Strang von DNA eingebaut worden ist, urn dadurch das eingebaute Nukleotidanalogon zu iden- 

tifizieren; 

(vi) Zugeben einer oder mehrere chemischerVerbindungen, urn dauerhaft jede nicht-reagierte -OH-Gruppe auf 
dem Primer, angehangt an die Nukleinsaure oder auf einem Primer-Extension-Strang, gebildet durch Zugabe 
eines oder mehrerer Nukleotide oder Nukleotidanaloga zu dem Primer zu schtitzen; 

5 5 (vii) Spalten des spaltbaren Linkers zwischen dem Nukleotidanalogon, das in den wachsenden Strang von DNA 

eingebaut worden ist, und der eindeutigen Markierung; 

(viii) Spalten der spaltbaren chemischen Gruppe, die die -OH-Gruppe an der 3'-Position der Desoxyribose 
schQtzt, urn die -OH-Gruppe zu entschOtzen, und Waschen der festen Oberflache urn gespaltene Verbindungen 
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zu entfernen; und 

(ix) Wiederholen der Schritte (iii) bis (viii), urn fur jede Wiederholung, die Identitat des neu eingebauten Nukleo- 
tidanalogons in den wachsenden Strang von DNA zu bestimmen; 

5 wobei, wenn die eindeutige Markierung ein Farbstoff ist, die Reihenfolge, in weicher die Schritte (v) bis (vii) durch- 

gefilhrt werden: (v), (vi) und (vii) ist; und 

wenn die eindeutige Markierung ein Massen-Tag ist, die Reihenfolge, in weicher die Schritte (v) bis (vii) durchgefuhrt 
werden: (vi), (vii) und (v) ist. 

10 2. Verfahren nach Anspruch 1 , wobei die feste Oberflache Glas, Silikon Oder Gold ist. 

3. Verfahren nach Anspruch 1 , wobei die feste Oberflache ein Magnetkugelchen, ein Chip, ein Kanal in einem Chip 
Oder ein poroser Kanal in einem Chip ist. 

15 4. Verfahren nach Anspruch 1 , wobei der Schritt des Anheftens der Nukleinsaure an die feste Oberflache umfasst: 

(i) Beschichten der festen Oberflache mit einem Phosphinrest, 

(ii) Anheften einer Azidogruppe an das 5'-Ende der Nukleinsaure, und 

(iii) Immobilisieren des 5'-Endes der Nukleinsaure auf der festen Oberflache durch Wechselwirkung zwischen 
20 dem Phosphinrest auf der festen Oberflache und der Azidogruppe am S'-Ende der Nukleinsaure. 

5. Verfahren nach Anspruch 4, wobei der Schritt des Beschichtens der festen Oberflache mit dem Phosphinrest umfasst: 

(i) Beschichten der Oberflache mit einem primaren Amin, und 
25 (ii) Kovalentes Bind en eines N-Hydroxysuccinimidylesters von Triarylphosphin mit dem primaren Amin. 

6. Verfahren nach Anspruch 1 , wobei die Nukleinsaure, die an die feste Oberflache angeheftet wird, eine einzelstrangige 
DNA ist. 

30 7. Verfahren nach Anspruch 1 , wobei die Nukleinsaure, die in Schritt (i) an die feste Oberflache angeheftet wird, eine 
doppelstrangige DNA ist, wobei nur ein Strang direkt an die feste Oberflache angeheftet wird, und wobei der Strang, 
der nicht direkt an die feste Oberflache angeheftet wird, vordem Obergang zu Schritt (ii) durch Denaturieren entfernt 
wird. 

35 8. Verfahren nach Anspruch 1 , wobei die Nukleinsaure, die an die feste Oberflache angeheftet wird, eine RNA ist, und 
die Polymerase in Schritt (iii) reverse Transkriptase ist. 

9. Verfahren nach Anspruch 1 , wobei der Primer an ein 3'-Ende der Nukleinsaure in Schritt (ii) angeheftet wird, und 
wobei der angeheftete Primer eine stabile Schleife und eine -OH-Gruppe an einer 3'-Position einer Desoxyribose 

40 umfasst, fahig zum Selbstprimen in der Polymerasereaktion. 

10. Verfahren nach Anspruch 1, wobei der Schritt des Anheftens des Primers an die Nukleinsaure Hybridisieren des 
Primers an die Nukleinsaure Oder Ligieren des Primers an die Nukleinsaure umfasst. 

45 11. Verfahren nach Anspruch 1, wobei eines Oder mehrere von vier verschiedenen Nukleotidanaloga in Schritt (iii) 
zugegeben wird, wobei jedes verschiedene Nukleotidanalogon eine verschiedene Base umfasst, ausgewahlt aus 
der Gruppe bestehend aus Thymin oder Uracil oder einem Analogon von Thymin oder Uracil, Adenin oder einem 
Analogon von Adenin, Cytosin oder einem Analogon von Cytosin, und Guanin oder einem Analogon von Guanin, 
und wobei jedes der vier verschiedenen Nukleotidanaloga eine eindeutige Markierung umfasst. 

50 

12. Verfahren nach Anspruch 1, wobei die spaltbare chemische Gruppe, die die -OH-Gruppe an der 3'-Position der 
Desoxyribose in dem Nukleotidanalogon schutzt -CH 2 CH=CH 2 ist. 

13. Verfahren nach Anspruch 1, wobei die eindeutige Markierung, die an das Nukleotidanalogon angeheftet ist, ein 
55 fluoreszierender Rest oder ein fluoreszierender Halbleiterkristall ist. 

14. Verfahren nach Anspruch 13, wobei der fluoreszierende Rest ausgewahlt ist aus der Gruppe bestehend aus 5- 
Carboxyfluorescein, 6-Carboxyrhodamin-6G, N,N,N\NMetramethyl-6-carboxyrhodamin, und 6-carboxy-X-rhoda- 
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min. 

15. Verfahren nach Anspruch 1, wobei die eindeutige Markierung, die an das Nukleotidana logon angeheftet ist, ein 
Fluoreszenzenergieubertragungs-Tag ist, die einen Energieubertragungsdonor und einen EnergieObertragungsak- 
zeptor umfasst. 

1 6. Verfahren nach Anspruch 1 5, wobei der Energieubertragungsdonor 5-Carboxyfluorescein Oder Cyanin ist, und wobei 
der Energieubertragungsakzeptor ausgewahlt ist ausder Gruppe bestehend aus Dichlorcarboxyfluorescein, Dichlor- 
6-carboxyrhodamin-6G, Dichlor-N,N,N',N'-tetramethyl-6-carboxyrhodamin, und Dichlor-6-carboxy-X-rhodamin. 

17. Verfahren nach Anspruch 1, wobei die eindeutige Markierung, die an das Nukleotidanalogon angeheftet ist, ein 
Massen-Tag ist, die uber ein Massenspektrometer detektiert und unterschieden werden kann. 

18. Verfahren nach Anspruch 17, wobei das Massen-Tag ausgewahlt ist aus der Gruppe bestehend aus einer 2-nitro- 
a-methyl-benzyl-Gruppe, einer 2-nitro-a-methyl-3-fluorbenzyl-Gruppe, einer 2-nitro-a-methyl-3,4-difluorbenzyl- 
Gruppe, und einer 2-nitro-a-methyl-3,4-dimethoxybenzyl-Gruppe. 

19. Verfahren nach Anspruch 1, wobei die eindeutige Markierung Qber einen spaltbaren Linker an eine 5-Position von 
Cytosin oder Thymin oder an eine 7-Position von Deaza-adenin oder Deaza-guanin angeheftet ist. 

20. Verfahren nach Anspruch 1 , wobei der spaltbare Linker zwischen der eindeutigen Markierung und dem Nukleoti- 
danalogon gespaltet wird durch ein Mittel ausgewahlt aus der Gruppe bestehend aus einem oder mehreren physi- 
kalischen Mitteln, einem chemischen Mittel, einem physikalisch-chemischen Mittel, Hitze und Licht. 

21 . Verfahren nach Anspruch 20, wobei der spaltbare Linker ein photospaltbarer Linker ist, der einen 2-Nitrobenzylrest 
umfasst. 

22. Verfahren nach Anspruch 1, wobei die spaltbare chemische Gruppe, verwendet um die -OH-Gruppe an der 3'- 
Position der Desoxyribose zu schutzen, gespaltet wird durch ein Mittel ausgewShlt aus der Gruppe bestehend aus 
einem oder mehreren physikalischen Mitteln, einem chemischen Mittel, eines physikalisch-chemischen Mittel, Hitze 
und Licht. 

23. Verfahren nach Anspruch 1 , wobei die in Schritt (vi) zugegebenen chemischen Verbindungen zum permanenten 
Schutzen jeder nicht-reagierten -OH-Gruppe auf dem Primer, angeheftet an die Nukleinsaure oder auf dem Primer- 
Extensionstrang, eine Polymerase und ein oder mehrere verschiedene Didesoxynukleotide oder Analoga von Di- 
desoxynukleotiden sind. 

24. Verfahren nach Anspruch 23, wobei die verschiedenen Didesoxynukleotide ausgewahlt sind aus der Gruppe be- 
stehend aus 2\3 , -Didesoxyadenosin-5'-triphosphat, 2',3'-Didesoxyguanosin-5'-triphosphat, 2',3-Didesoxycytidin- 
5'-triphosphat, 2\3'-Didesoxythymidin-5'-triphosphat, 2 , ,3'-Didesoxyuridin-5*-triphosphat, und ihre Analoga. 

25. Verfahren nach Anspruch 1, wobei eine Polymerase und ein oder mehrere von vier verschiedenen Didesoxynu- 
kleotiden in Schritt (vi) zugegeben werden, und wobei jedes verschiedene Didesoxynukleotid ausgewahlt ist aus 
der Gruppe bestehend aus 2',3'-Didesoxyadenosin-5 , -triphosphat oder einem Analogon von 2\3'-Didesoxyadeno- 
sin-S'-triphosphat; 2',3'-Didesoxyguanosin-5'-triphosphat oder einem Analogon von 2\3 , -Didesoxyguanosin-5'-tri- 
phosphat; 2 , ,3'-Didesoxycytidin-5'-triphosphat oder einem Analogon von 2\3'-Didesoxycytidin-5 , -triphosphat; und 
2',3'-Didesoxythymidin-5 , ~triphosphat oder 2\3'-Didesoxyuridin-5'-triphosphat oder einem Analogon von 2\3'-Di- 
desoxythymidin-5'-triphosphat oder einem Analogon von 2\3 , -Didesoxyuridin-5'-triphosphat. 

26. Verfahren nach Anspruch 17, wobei das Massen-Tag detektiert wird unter Verwendung eines parallelen Massen- 
spektrometriesystems, das eine Vielzahl von atmospheric pressure chemical ionization-Massenspektrometem um- 
fasst zur parallelen Analyse einer Vielzahl von Proben, die Massen-Tags umfassen. 

27. Ein Verfahren zum gleichzeitigen Sequenzieren einer Vielzahl von verschiedenen Nukleinsauren, welches das 
gleichzeitige Anwenden des Verfahrens nach Anspruch 1 auf die Vielzahl von verschiedenen Nukleinsauren umfasst. 

28. Verwendung des Verfahrens nach Anspruch 1 oder 27 zum Nachweis von Einzelnukleotidpolymorphismen, gene- 
tische Mutationsanalyse, serielle Analyse von Genexpression, Genexpressionsanalyse, Identifikation in der Foren- 
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sik, Assoziationsstudien genetischer Krankheiten, DNA-Sequenzierung, genomische Sequenzierung, Translations- 
analyse oder Transkriptionsanalyse. 

Ein Verfahren zum Anheften einer Nukleinsaure an eine feste Oberflache, welches umfasst: 

(i) Beschichten der festen Oberflache mit elnem Phosphinrest, 

(ii) Anheften einer Azidogruppe an ein 5'-Ende der NukeinsSure, und 

(iii) Immobilisieren des 5-Endes der Nukleinsaure auf der festen Oberflache durch Wechselwirkung zwischen 
dem Phosphinrest auf der festen Oberflache und der Azidogruppe am 5-Ende der Nukleinsaure. 

Verfahren nach Anspruch 29, wobei der Schritt des Beschichtens der festen Oberflache mit dem Phosphinrest 
umfasst: 

(i) Beschichten der Oberflache mit einem primaren Amin, und 
w (ii) Kovalentes Binden eines N-Hydroxysuccinimidylesters von Triarylphosphin mit dem primaren Amin. 

31. Verfahren nach Anspruch 29, wobei die feste Oberflache Glas, Silikon oder Gold ist. 

32. Verfahren nach Anspruch 29, wobei die feste Oberflache ein MagnetkUgelchen, ein Chip, ein Kanal in einem Chip 
20 oder ein poroser Kanal in einem Chip ist. 

33. Verfahren nach Anspruch 29, wobei die Nukleinsaure, die an die feste Oberflache angeheftet wird, eine einzelst- 
rangige DNA, eine doppelstrangige DNA oder eine RNA ist. 

25 34. Verfahren nach Anspruch 33, wobei die Nukleinsaure eine doppelstrangige DNA ist und nur ein Strang an die feste 
Oberflache angeheftet ist. 

35. Verfahren nach Anspruch 34, wobei der Strang der doppelstrangigen DNA, der nicht an die feste Oberflache an- 
geheftet ist, durch Denaturieren entfemt wird. 

30 

36. Verwendung des Verfahrens nach Anspruch 29 zum Anheften einer Nukleinsaure an eine feste Oberflache fur 
Genexpressionsanalyse, Microarray-basierte Genexpressionsanalyse, Mutationsnachweis, Translationsanalyse, 
oder Transkriptionsanalyse dieser Nukleinsaure. 

35 37. Ein Nukleotidanalogon welches umfasst: 

(a) eine Base ausgewahlt aus der Gruppe bestehend aus Adenin oder einem Analogon von Adenin, Cytosin 
oder einem Analogon von Cytosin, Guanin oder einem Analogon von Guanin, Thymin oder einem Analogon 
von Thymin, und Uracil oder einem Analogon von Uracil; 
40 (b) eine eindeutige Markierung angeheftet uber einen spaltbaren Linker an die Base oder an ein Analogon der 

Base; 

(c) eine Desoxyribose; und 

(d) eine spaltbare chemische Gruppe, urn eine -OH-Gruppe an einer 3-Position der Desoxyribose zu schutzen, 
wobei die spaltbare chemische Gruppe -CH 2 OCH 3 oder -CH 2 CH=CH 2 ist. 

45 

38. Nukleotidanalogon nach Anspruch 37, wobei die spaltbare chemische Gruppe, welche die -OH-Gruppe an der 3'- 
Position der Desoxyribose in dem Nukleotidanalogon schutzt -CH 2 CH=CH 2 ist. 

39. Nukleotidanalogon nach Anspruch 37, wobei die eindeutige Markierung ein fluoreszierender Rest oder ein fluores- 
so zierender Halbleiterkristall ist. 

40. Nukleotidanalogon nach Anspruch 39, wobei der fluoreszierende Rest ausgewahlt ist aus der Gruppe bestehend 
aus 5-Carboxyfluorescein, 6-Carboxyrhodamin-6G, N.N.N'.N'-Tetramethyl-S-carboxyrhodamin, und 6-Carboxy-X- 
rhodamin. 

55 

41. Nukleotidanalogon nach Anspruch 37, wobei die eindeutige Markierung ein HuoreszenzenergieUbertragungs-Tag 
ist, die einen EnergieGbertragungsdonor und einen EnergieQbertragungsakzeptor umfasst. 
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42. Nukleotidanalogon nach Anspruch 41 , wobei der EnergieObertragungsdonor 5-Carboxyfluorescein Oder Cyanin ist, 
und wobei der EnergieQbertragungsakzeptor ausgewShlt ist aus der Gruppe bestehend aus Dichlorcarboxyfluore- 
scein, Dichlor-6-carboxyrhodamin-6G, Dichlor-N.N.N'.N'-tetramethyl-e-carboxyrhodamin, und Dichlor-6-carboxy-X- 
rhodamin. 

43. Nukleotidanalogon nach Anspruch 37, wobei die eindeutige Markierung ein Massen-Tag ist, die durch ein Massen- 
spektrometer detektiert und unterschieden werden kann. 

44. Nukleotidanalogon nach Anspruch 43, wobei das Massen-Tag ausgewahlt ist aus der Gruppe bestehend aus einer 
2-Nitro-a-methyl-benzyl-Gruppe, einer 2-Nitro-a-methyl-3-fluorbenzyl-Gruppe, einer 2-Nitro-a-methyl-3,4-difluor- 
benzyl-Gruppe, und einer 2-Nitro-a-methyl-3,4-dimethoxybenzyl-Gruppe. 

45. Nukleotidanalogon nach Anspruch 37, wobei die eindeutige Markierung uber einen spaltbaren Linker an eine 5- 
Position von Cytosin Oder Thymin oder an eine 7-Position von Deaza-adenin Oder Deaza-guanin angeheftet ist. 

46. Nukleotidanalogon nach Anspruch 37, wobei der Linker zwischen der eindeutigen Markierung und dem Nukleoti- 
danalogon spaltbar ist durch ein Mittel ausgewShlt aus der Gruppe bestehend aus einem oder mehreren physika- 
lischen Mitteln, einem chemischen Mittel, einem physikalisch-chemischen Mittel, Hitze und Licht. 

47. Nukleotidanalogon nach Anspruch 46, wobei der spaltbare Linker ein photospaltbarer Linker ist, der einen 2-Nitro- 
benzylrest umfasst. 

48. Nukleotidanalogon nach Anspruch 37, wobei die spaltbare chemische Gruppe, verwendet urn die -OH-Gruppe an 
der 3'-Position der Desoxyribose zu schutzen, spaltbar ist durch ein Mittel ausgewdhlt aus der Gruppe bestehend 
aus einem oder mehreren physikalischen Mitteln, einem chemischen Mittel, einem physikalisch-chemischen Mittel, 
Hitze und Licht. 

49. Nukleotidanalgon nach Anspruch 37, wobei das Nukleotidanalogon ausgwShlt ist aus der Gruppe bestehend aus: 




34 



EP 1 337 541 B1 




35 



EP 1 337 541 B1 



5 



10 




t 



15 

und 




9 



wobei R -CH 2 OCH 3 oder -CH 2 CH=CH 2 ist. 

35 

51 . Nukleotidanalogon nach Anspruch 37, wobei das Nukleotidanalogon ausgewShlt ist aus der Gruppe bestehend aus: 
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wobei Tag 1t Tag 2 , Tag 3 , und Tag 4 vier verschiedene Massen-Tag-Markierungen sind; und 
wobei R -CH 2 OCH 3 oder -CH 2 CH=CH 2 ist. 

50 52. Nukleotidanalogon nach Anspruch 51 , wobei das Nukleotidanalogon ausgewShlt ist aus der Gruppe bestehend aus: 
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wobei R -CH 2 OCH 3 oder -CH 2 CH=CH 2 ist. 
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53. Verwendung des Nukleotidanalogons nach Anspruch 37 zum Nachweis von Einzelnukleotid polymorph ismen, ge- 
netische Mutationsanalyse, serielle Analyse von Genexpression, Genexpressionsanalyse, Identifizierung in der 
Forensik, Assoziationsstudien genetischer Krankheiten, DNA-Sequenzierung, genomische Sequenzlerung, Trans- 
lationsanalyse oder Transkriptionsanalyse. 



Revendications 

1 . Proc6de de sequencage d'un acide nu clique par determination sequentielle de I'identite d'analogues nucieotidiques 
10 apres que les analogues nucieotidiques ont ete incorpores dans un brin en croissance d'ADN dans une reaction 

de polymerase, qui comprend les etapes suivantes : 

(i) attacher une extremity 5* de Tackle nuc!6ique & une surface solide ; 

(ii) attacher une amorce d I'acide nucieique attache a la surface solide ; 

15 (iii) ajouter une polymerase et un ou plusieurs analogues nucieotidiques differents a I'acide nucieique pour 

incorporer ainsi un analogue nucieotidique dans le brin en croissance d'ADN, dans lequel ('analogue nucieoti- 
dique incorpore interrompt la reaction de polymerase et dans lequel chaque analogue nucieotidique different 
comprend (a) une base choisie dans le groupe constitu6 par I'adenine, la guanine, la cytosine, la thymine et 
I'uracile, et leurs analogues ; (b) un marqueur unique attache par le biais d'un lieur clivable a la base ou a un 

20 analogue de la base ; (c) un desoxyribose ; et (d) un groupe chimique clivable pour coiffer un groupe -OH a 

une position 3' du desoxyribose, dans lequel le groupe chimique clivable est -CH 2 OCH 3 ou -CH 2 CH=CH 2 ; 

(iv) laver la surface solide pour retirer les analogues nucieotidiques non incorpores ; 

(v) determiner Tidentite du marqueur unique attache a I'analogue nucieotidique qui a ete incorpore dans le brin 
en croissance d'ADN, de manure a identifier ainsi les analogues nucieotidiques incorpores ; 

25 (vi) ajouter un ou plusieurs composes chimiques pour coiffer en permanence tout groupe -OH n'ayant pas r6agi 

sur Tamorce attachee a I'acide nucieique ou surun brin d'extension de I'amorce forme en ajoutantun ou plusieurs 
nucleotides ou analogues nucieotidiques a Tamorce ; 

(vii) diver le lieur clivable entre I'analogue nucieotidique qui a 6t6 incorpore dans le brin en croissance d'ADN 
et le marqueur unique ; 

30 (viii) diver le groupe chimique clivable coiffant le groupe -OH £ la position 3' du desoxyribose pour decoiffer le 

groupe -OH, et laver la surface solide pour retirer les composes cliv6s ; et 

(ix) r6peter les etapes (iii) a (viii) de maniere a determiner, pour chaque repetition, I'identite de I'analogue 
nucieotidique nouvellement incorpore dans le brin en croissance d'ADN ; 

35 dans lequel, si le marqueur unique est un colorant, Tordre dans lequel les etapes (v) £ (vii) sont effectu6es est : (v), 

(vi) et (vii) ; et 

si le marqueur unique est un marqueur de masse, I'ordre dans lequel les etapes (v) a (vii) sont effectu6es est : (vi), 

(vii) et (v). 

40 2. Proc6de selon la revendication 1, dans lequel la surface solide est du verre, du silicium ou de I'or. 

3. Precede selon la revendication 1 , dans lequel la surface solide est une bille magnetique, une puce, un canal dans 
une puce, ou un canal poreux dans une puce. 

45 4. Proc6de selon la revendication 1, dans lequel l'6tape d'attachement de I'acide nudeique a la surface solide 
comprend : 

(i) le revetement de la surface solide avec un groupe fonctionnel phosphine, 

(ii) I'attachement d'un groupe azido a I'extremite 5' de I'acide nucieique, et 

so (iii) ('immobilisation de Textremite 5' de I'acide nucieique sur la surface solide par interaction entre le groupe 

fonctionnel phosphine sur la surface solide et le groupe azido sur Textremite 5' de Tacide nucieique. 

5. Proc6d6 selon la revendication 4, dans lequel T6tape de revetement de la surface solide avec le groupe fonctionnel 
phosphine comprend : 

55 

(i) le revetement de la surface avec une amine primaire, et 

(ii) le couplage covalent d'un ester N-hydroxy-succinimidylique de triarylphosphine avec Tamine primaire. 
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6. Procede selon la revendi cation 1 , dans lequel I'acide nucleique qui est attache a la surface solide est un ADN simple 
brin. 

7. Procede selon la revendication 1, dans lequel I'acide nucleique qui est attache a la surface solide dans I'etape (i) 
5 est un ADN double brin, dans lequel un seul brin est directement attach6 a la surface solide, et dans lequel le brin 

qui n est pas directement attache a la surface solide est retire par denaturation avant de passer a I'etape (ii). 

8. Procede selon la revendication 1 , dans lequel I'acide nucleique qui est attache a la surface solide est un ARN, et 
la polymerase dans I'etape (iii) est une transcriptase inverse. 

w 

9. Procede selon la revendication 1 , dans lequel I'amorce est attachee a une extremite 3' de I'acide nucleique dans 
I'etape (ii) et dans lequel I'amorce attachee comprend une boucle stable et un groupe -OH a une position 3' d'un 
desoxyribose susceptible d'auto-amorcage dans la reaction de polymerase. 

15 10. Procede selon la revendication 1, dans lequel I'etape d'attachement de I'amorce a I'acide nucleique comprend 
I'hybridation de I'amorce avec I'acide nucleique ou la ligature de I'amorce a I'acide nucleique. 

11. Procede selon la revendication 1 , dans lequel Tun ou plusieurs de quatre analogues nucleotidiques differents sont 
ajoutes dans I'etape (iii), dans lequel chaque analogue nucleotidique different comprend une base differente choisie 

20 dans le groupe constitue par la thymine ou I'uracile ou un analogue de thymine ou d'uracile, i'adenine ou un analogue 

d'adenine, la cytosine ou un analogue de cytosine, et la guanine ou un analogue de guanine, et dans lequel chacun 
des quatre analogues nucleotidiques differents comprend un marqueur unique. 

12. Procede selon la revendication 1 , dans lequel le groupe chimique clivable qui coiffe le groupe -OH a la position 3' 
25 du desoxyribose dans I'analogue nucleotidique est -CH 2 CH=CH 2 - 

13. Procede selon la revendication 1 , dans lequel le marqueur unique qui est attache a ('analogue nucleotidique est un 
groupe fonctionnel fluorescent ou un cristal semi-conducteur fluorescent. 

30 1 4. Procede selon la revendication 1 3, dans lequel le groupe fonctionnel fluorescent est choisi dans le groupe constitue 
par la 5-carboxyfluoresceine, la 6-carboxyrhodamine-6G, la N,N,N',N'-tetramethyl-6-carboxyrhodamine et la 6-car- 
boxy-X-rhodamine. 

15. Procede selon la revendication 1 , dans lequel le marqueur unique qui est attache a I'analogue nucleotidique est un 
35 marqueur de transfert d'energie de fluorescence qui comprend un donneur de transfert d'energie et un accepteur 

de transfert d'energie. 

16. Procede selon la revendication 15, dans lequel le donneur de transfert d'energie est la 5-carboxyfluoresceine ou la 
cyanine, et dans lequel I'accepteur de transfert d'energie est choisi dans le groupe constitue par la dichlorocarboxy- 

40 fluoresceine, la dichloro-6-carboxyrhodamine-6G, la dichloro-N.N.N'.N'-tetramethyl-e-carboxyrhodamine, et la di- 

chloro-6-carboxy-X-rhodamine. 

17. Procede selon la revendication 1 , dans lequel le marqueur unique qui est attache a ('analogue nucleotidique est un 
marqueur de masse qui peut etre detecte et differencie par un spectrometre de masse. 

45 

18. Procede selon la revendication 17, dans lequel le marqueur de masse est choisi dans le groupe constitue par un 
groupe 2-nitro-a-methyl-benzyle, un groupe 2-nitro-a-methyl-3-fluorobenzyle, un groupe 2-nitro-a-methyl-3,4-di- 
fluorobenzyle, et un groupe 2-nitro-a-methyl-3,4-dimethoxybenzyle. 

50 19. Procede seion la revendication 1 , dans lequel le marqueur unique est attache par le biais d'un lieur clivable a une 
position 5 d'une cytosine ou d'une thymine ou a une position 7 d'une deaza-adenine ou d'une deaza-guanine. 

20. Procede selon la revendication 1 , dans lequel le lieur clivable entre le marqueur unique et I'analogue nucleotidique 
est clive par un moyen choisi dans le groupe constitue par I'un ou plusieurs parmi un moyen physique, un moyen 

55 chimique, un moyen physico-chimique, la chaleur, et la lumiere. 

21. Procede selon la revendication 20, dans lequel le lieur clivable est un lieur photoclivable qui comprend un groupe 
fonctionnel 2-nitrobenzyle. 
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22. Procede selon la revendication 1 , dans lequel le groupe chimique clivable utilise pour coiffer le groupe -OH a la 
position 3' du desoxyribose est dive par un moyen choisi dans le groupe constitue par Tun ou plusieurs parmi un 
moyen physique, un moyen chimique, un moyen physico-chimique, la chaleur, et la lumiere. 

23. Procede selon la revendication 1, dans lequel les composes chimiques ajoutes dans I'etape (vi) pour coiffer en 
permanence tout groupe -OH n'ayant pas reagi sur I'amorce attachee a I'acide nucleique ou sur le brin d'extension 
de ramorce sont une polymerase et un ou plusieurs did6soxynucleotides ou analogues de didesoxynucleotides 
differents. 

24. Procede selon la revendication 23, dans lequel les didesoxynucleotides differents sont choisis dans le groupe 
constitue par la 2',3'-didesoxyadenosine 5'-triphosphate, la 2\3'-didesoxyguanosine 5' -triphosphate, la 2',3'-dide- 
soxycytidine S'-triphosphate, la 2',3'-didesoxythymidine 5'-triphosphate, la 2',3'-didesoxyuridine 5'-triphosphate, et 
leurs analogues. 

25. Procede selon la revendication 1, dans lequel une polymerase et I'un ou plusieurs de quatre didesoxynucleotides 
differents sont ajoutes dans I'etape (vi), et dans lequel chaque didesoxynucleotide different est choisi dans le groupe 
constitue par la 2',3'-didesoxyadenosine 5'-tri phosphate ou un analogue de 2\3'-didesoxyadenosine 5- 
triphosphate ; la 2\3 , -didesoxyguanosine 5'-triphosphate ou un analogue de 2\3'-didesoxyguanosine 5'- 
triphosphate ; la 2',3*-didesoxycytidine 5'-triphosphate ou un analogue de 2',3'-didesoxycytidine 5'-tri phosphate ; 
et la 2\3*-didesoxythymidine 5'-triphosphate ou la 2',3'-didesoxyuridine 5'-tri phosphate ou un analogue de 2\3'- 
didesoxythymidine 5'-tri phosphate ou un analogue de 2',3'-didesoxyuridine 5'-triphosphate. 

26. Procede selon la revendication 17, dans lequel le marqueur de masse est detecte en utilisant un systeme de 
spectrometrie de masse parallele qui comprend une pluralite de spectrometres de masse a ionisation chimique a 
pression atmospherique pour analyse parallele d'une pluralite d'echantillons comprenant des marqueurs de masse. 

27. Procede de sequencage simultane d'une pluralite d'acides nucleiques differents, qui comprend I'application simul- 
tanee du procede selon la revendication 1 a la pluralite d'acides nucleiques differents. 

28. Utilisation du procede selon la revendication 1 ou 27 pour la detection de polymorphismes de nucleotides uniques, 
I'analyse de mutations genetiques, I'analyse en serie de I'expression de genes, I'analyse de I'expression de genes, 
I'identification dans des expertises medico-legales, les etudes dissociation de maladies genetiques, le sequencage 
d'ADN, le sequencage genomique, I'analyse traductionnelle, ou I'analyse transcriptionnelle. 

29. Procede d'attachement d'un acide nucleique a une surface solide qui comprend : 

(i) le revetement de la surface solide avec un groupe fonctionnel phosphine, 

(ii) I'attachement d'un groupe azido a I'extremite 5' de I'acide nucleique, et 

(iii) Timmobilisation de I'extremite 5' de I'acide nucleique sur la surface solide par interaction entre le groupe 
fonctionnel phosphine sur la surface solide et le groupe azido sur I'extremite 5' de I'acide nucleique. 

30. Procede selon la revendication 29, dans lequel I'etape de revetement de la surface solide avec le groupe fonctionnel 
phosphine comprend : 

(i) le revetement de la surface avec une amine primaire, et 

(ii) le couplage covalent d'un ester N-hydroxy-succinimidylique de triarytphosphine avec I'amine primaire. 

31. Procede selon la revendication 29, dans lequel la surface solide est du verre, du silicium ou de Tor. 

32. Procede selon la revendication 29, dans lequel la surface solide est une bille magnetique, une puce, un canal dans 
une puce, ou un canal poreux dans une puce. 

33. Procede selon la revendication 29, dans lequel I'acide nucleique qui est attache a la surface solide est un ADN 
simple brin, un ADN double brin ou un ARN. 

34. Procede selon la revendication 33, dans lequel I'acide nucleique est un ADN double brin et un seul brin est attache 
a la surface solide. 
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35. Procede selon la revendication 34, dans lequel le brin de I'ADN double brin qui n'est pas attache £ la surface solide 
est retire par denatu ration. 

36. Utilisation du procGde selon la revendication 29 pour attacher un acide nucleique a une surface solide pour analyse 
5 de I'expression de genes, analyse de I'expression de genes basee sur des puces d ADN, detection de mutations, 

analyse traductionnelle, ou analyse transcriptionnelle dudit acide nucleique. 

37. Analogue nucleotidique qui comprend : 

10 (a) une base choisie dans le groupe constitue par I 'adenine ou un analogue d'adenine, la cytosine ou un analog ue 

de cytosine, la guanine ou un analogue de guanine, la thymine ou un analogue de thymine, et I'uracile ou un 
analogue d'uracile ; 

(b) un marqueur unique attache par le biais d'un lieur clivable & la base ou d un analogue de la base ; 

(c) un desoxyribose ; et 

*5 (d) un groupe chimique clivable pour coiffer un groupe -OH a une position 3* du desoxyribose, dans lequel le 

groupe chimique clivable est -CH 2 OCH 3 ou -CH 2 CH=CH 2 . 

38. Analogue nucleotidique selon la revendication 37, dans lequel le groupe chimique clivable qui coiffe le groupe -OH 
h la position 3' du desoxyribose dans I'analogue nucleotidique est -CH 2 CH=CH 2 . 

20 

39. Analogue nucleotidique selon la revendication 37, dans lequel le marqueur unique est un groupe fonctionnel fluo- 
rescent ou un cristal semi-conducteur fluorescent. 

40. Analogue nucleotidique selon la revendication 39, dans lequel le groupe fonctionnel fluorescent est choisi dans le 
25 groupe constitue par la 5-carboxyfluoresceine, la 6-carboxyrhodamine-6G, la N,N,N',N'-tetramethyl-6-carboxyrho- 

damine, et la 6-carboxy-X-rhodamine. 

41. Analogue nucleotidique selon la revendication 37, dans lequel le marqueur unique est un marqueur de transfert 
d'energie de fluorescence qui comprend un donneur de transfert d'energie et un accepteur de transfert d'energie. 

30 

42. Analogue nucleotidique selon la revendication 41, dans lequel le donneur de transfert d'energie est la 5-carboxy- 
fluoresceine ou la cyanine, et dans lequel I'accepteur de transfert d'energie est choisi dans le groupe constitue par 
la dichlorocarboxyfluoresceine, la dichloro-6-carboxyrhodamine-6G, la dichloro-N.N.N'.N'-tetramethyl-S-carboxy- 
rhodamine, et la dichloro-6-carboxy-X-rhodamine. 

35 

43. Analogue nucleotidique selon la revendication 37, dans lequel le marqueur unique est un marqueur de masse qui 
peut §tre detecte et difference par un spectrometre de masse. 

44. Analogue nucleotidique selon la revendication 43, dans lequel le marqueur de masse est choisi dans le groupe 
40 constitue par un groupe 2-nitro-a-methyl-benzyle, un groupe 2-nitro-a-methyl-3-fluorobenzyle, un groupe 2-nitro- 

a-methyl-3,4-difluorobenzyle, et un groupe 2-nitro-a-methyl-3,4-dimethoxybenzyle. 

45. Analogue nucleotidique selon la revendication 37, dans lequel le marqueur unique est attache par le biais d'un lieur 
clivable d une position 5 d'une cytosine ou d'une thymine ou £ une position 7 d'une deaza-adenine ou d'une deaza- 

45 guanine. 

46. Analogue nucleotidique selon la revendication 37, dans lequel le lieur entre le marqueur unique et I'analogue nu- 
cleotidique est clivable par un moyen choisi dans le groupe constitue par I'un ou plusieurs parmi un moyen physique, 
un moyen chimique, un moyen physico-chimique, la chaleur, et la lumiere. 

50 

47. Analogue nucleotidique selon la revendication 46, dans lequel le lieur clivable est un lieur photoclivable qui comprend 
un groupe fonctionnel 2-nitrobenzyle. 

48. Analogue nucleotidique selon la revendication 37, dans lequel le groupe chimique clivable utilise pour coiffer le 
55 groupe -OH a la position 3' du desoxyribose est clivable par un moyen choisi dans le groupe constitue par I'un ou 

plusieurs parmi un moyen physique, un moyen chimique, un moyen physico-chimique, la chaleur, et la lumiere. 

49. Analogue nucleotidique selon la revendication 37, ('analogue nucleotidique etant choisi dans le groupe constitue par : 
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et 



^^^^^^ • 




dans lesquels Dye 1t Dye 2 , Dye 3 et Dye 4 sont quatre marqueurs colorants differents ; et 
dans lesquels R est -CH 2 OCH 3 ou -CH 2 CH=CH 2 . 

50. Analogue nucl6otidique selon la revendlcation 49, ('analogue nucleotidique etant choisi dans le groupe constitue par 
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et 




» 



• 



5 



dans lesquels R est -CH 2 OCH 3 ou -CH 2 CH=CH 2 . 
51 . Analogue nucleotidique selon la revendication 37, 1'analogue nucleotidique etant choisi dans le groupe constitue par 
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A- 6- 6- vr^> 



dans lesquels Tag 1( Tag 2 , Tag 3 et Tag 4 sont quatre marqueurs de masse differents ; et 
dans lesquels R est -CH 2 OCH 3 ou -CH 2 CH=CH 2 . 

52. Analogue nucleotidique selon la revendlcation 51 , 1'analogue nucleotidique etant choisi dans le groupe constitue par 
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D-P-O-0-O-P-O 

6- 6- 6 
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et 
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6- o- o 



^^^^ ; 



dans lesquels R est -CH 2 OCH 3 ou -CH 2 CH=CH 2 . 

53. Utilisation de I'analogue nucleotidique selon la revendication 37 pour la detection de polymorphismes de nucleotides 
uniques, I'analyse de mutations genetiques, I'analyse en serie de I'expression de genes, ('analyse de I'expression 
de genes, identification dans des expertises medico-legales, les etudes dissociation de maladies genetiques, le 
sequencage d'ADN, le sequencage genomique, I'analyse traductionnelle, ou I'analyse transcriptionnelle. 
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R * H. CH 2 OCH 3 (MOM) or CH 2-CH^CH 2 (A»yl) 
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FIGURE 8 
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FIGURE 16 
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FIGURE 19 
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